The Open University 


M381 Number Theory and 
Mathematical Logic 


iversity 


The Open Un 


M381 Number Theory and 
Mathematical Logic 


Number Theory Unit 8 


Diophantine Equations 


Prepared for the Course Team by Alan Best 


| ne 
W 
| ee 


tt tot > 
YW © \ 


TAS I. 


The M381 Number Theory Course Team 


The Number Theory half of the course was produced by the following team: 
Alan Best Author 


Andrew Brown Course Team Chair and Academic Editor 
Roberta Cheriyan Course Manager 

Bob Coates Critical Reader 

Dick Crabbe Publishing Editor 

Janis Gilbert Graphic Artist 

Derek Goldrei Critical Reader 

Caroline Husher Graphic Designer 

John Taylor Graphic Artist 

with valuable assistance from: 

CMPU Mathematics and Computing, Course Materials Production Unit 
John Bayliss Reader 

Elizabeth Best Reader 

Jeremy Gray History Reader 

Alison Neil Reader 


The external assessor was: 
Alex Wilkie Reader in Mathematical Logic, University of Oxford 


This publication forms part of an Open University course. Details of this and other Open University 
courses can be obtained from the Student Registration and Enquiry Service, The Open University, PO 
Box 197, Milton Keynes, MK7 6BJ, United Kingdom: tel. +44 (0)870 300 6090, e-mail 
general-enquiries@open.ac.uk 


Alternatively, you may visit the Open University website at http://www.open.ac.uk where you can 
learn more about the wide range of courses and packs offered at all levels by The Open University. 


To purchase a selection of Open University course materials, visit http://www.ouw.co.uk, or contact 
Open University Worldwide, Michael Young Building, Walton Hall, Milton Keynes, MK7 6AA, United 
Kingdom, for a brochure: tel. +44 (0)1908 858793, fax +44 (0)1908 858787, e-mail 
ouw-customer-services@open.ac.uk 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published 1996. Reprinted 1997 and 2001. New edition 2007 with corrections. 
Copyright © 1996 The Open University 


All rights reserved; no part of this publication may be reproduced, stored in a retrieval system, 
transmitted or utilised in any form or by any means, electronic, mechanical, photocopying, 
recording or otherwise, without written permission from the publisher or a licence from the 
Copyright Licensing Agency Ltd. Details of such licences (for reprographic reproduction) may be 
obtained from the Copyright Licensing Agency Ltd, Saffron House, 6-10 Kirby Street, London 
ECIN 8TS; website http://www.cla.co.uk. 


Open University course materials may also be made available in electronic formats for use by 
students of the University. All rights, including copyright and related rights and database rights, in 
electronic course materials and their contents are owned by or licensed to The Open University, or 
otherwise used by The Open University as permitted by applicable law. 


In using electronic course materials and their contents you agree that your use will be solely for the 
purposes of following an Open University course of study or otherwise as licensed by The Open 
University or its assigns. 


Except as permitted above you undertake not to copy, store in any medium (including electronic 
storage or use in a website), distribute, transmit or re-transmit, broadcast, modify or show in public 
such electronic materials in whole or in part without the prior written consent of The Open 
University or in accordance with the Copyright, Designs and Patents Act 1988. 


Edited, designed and typeset by The Open University, using the Open University TEX System. 
Printed and bound in the United Kingdom by The Charlesworth Group, Wakefield. 

ISBN 978 0 7492 2278 9 

2.1 


CONTENTS 


Introduction 


1 


Pell’s Equation 

1.1 Pell’s equation and continued fractions 
1.2 Solution of Pell’s equation 

1.3 Solutions from the fundamental solution 


The Pythagorean Equation 

2.1 Primitive Pythagorean triples 

2.2 Special Pythagorean triples 
Fermat’s Last Theorem 

3.1 Some history of the Last Theorem 
3.2 The equation zt + y* = z4 

3.3 Related Diophantine equations 


Sums of Squares 


4.1 Representing primes as sums of two squares 


4.2 Sums of two squares, completed 
4.3 Sums of three and four squares 


Additional Exercises 


Solutions to the Problems 


Solutions to Additional Exercises 


Index 


ona oa - 


12 
12 
16 


18 
18 
20 
23 
24 
24 
29 
32 
34 
37 
42 


48 


INTRODUCTION 


A Diophantine equation is an equation in two or more variables which is to 
be solved within the set of integers. Diophantus was the first to consider 
such matters, his interest stemming from geometrical problems involving 
squares and cubes. Diophantus was only concerned with finding one solution 
rather than all solutions, and he was content with the solutions being 
rational numbers rather than integers. Nevertheless it is fitting that this 
immense branch of number theory is named after him. 


Practically nothing is known of Diophantus the person. He lived in 
Alexandria around 250 AD and wrote in Greek, although there is a 
suggestion that he might have been of Babylonian origin. His fame rests on 
his Arithmetica; it is believed that this consisted of thirteen books but only 
six have survived. In addition to posing many of the famous problems in 
number theory which will occupy our attention in this unit, Arithmetica 
presents the first real treatise of algebra introducing, as it does, 
revolutionary mathematical notation and symbolism. 


For fourteen or so centuries after Diophantus, little advance of any 
significance was made on general methods of solving Diophantine equations. 
Then Fermat arrived on the scene, his contributions in this area being 
regarded by many as the real origins of modern number theory. 


Our first encounter with a Diophantine equation was in Unit 1 where we 
considered the linear equation az + by = c. Thinking geometrically this 
equation represents a line in the Cartesian plane. To solve it as a 
Diophantine equation requires finding all (if any) lattice points through 
which the line passes. There are a number of questions which we can ask, 
and have asked, of a linear Diophantine equation. 


e Are there any integer solutions? 
e Is the set of integer solutions finite? 


e Is there a method of systematically finding all solutions? 


We had little difficulty supplying answers to all three questions in the case of 
the linear Diophantine equation. But the same questions can be asked of 
any polynomial equation P(£1,£2,..., n) = 0 in n variables and, more 
often than not, answering these questions presents considerable difficulty, 
even when n = 2 and P is a fairly simple polynomial. For example, consider 
the two variable equation z? — y? = 2. With a little trial and error you 
might be able to spot one solution and so answer the first of the above 
questions, but you would find the other two questions much more daunting. 
(Fermat managed them but we shall not go into his solutions here.) 


In this unit we shall concentrate on a few of the more famous Diophantine 
equations. We begin where we left the previous unit, looking at applications 
of continued fractions. 


Recall that a lattice point is one 
with integer coordinates. 


1 PELL’S EQUATION 


1.1 Pell’s equation and continued fractions 


Being unable to interest his contemporaries in his researches in number 
theory, Fermat took to issuing challenges to Europe’s best mathematicians, 
with England’s John Wallis being a prime target. Some of Fermat’s 
unpublished discoveries came to light via these arithmetic challenges. One of 
his first posers was the following from 1657. 


Given a non-square positive integer n, find an integer y such that 
ny? + 1 is also a square. If a general rule cannot be discovered find the 
smallest values of y for the cases n = 61 and n = 109. 


The underlying problem here is to solve the Diophantine equation 

x? — ny” = 1 for a general n, with the subsidiary challenge of finding 
particular solutions for the cases n = 61 and n = 109. (The trivial solution 
x = 1, y = 0 is discounted, the real task being to find solutions in positive 
integers.) Wallis together with his patron, Viscount Brouncker (who was the 
first president of the Royal Society), discovered a general method of solution, 
though they were unable to prove that their method always works. However 
the choice of posed special cases, n = 61 and 109, leaves no doubt that 
Fermat too was aware of some way of solving the problem. The values of y 
in the smallest solutions turn out to be y = 226 153980 for n = 61 and 

y = 15140 424 455 100 for n = 109 and these solutions were not likely to be 
found by trial and error! In contrast, for the adjacent values n = 60, 62, 108 
and 110 the y values in the smallest solution turn out to be y = 4, 8, 130 
and 2 respectively. 


Fermat’s posed problem was by no means the first appearance of the 
Diophantine equation z? — ny? = 1. The ancient Greeks had been 


considering the cases n = 2 and n = 3 in searching for rational We saw how to find these in Unit 7. 
approximations to v2 and v3 respectively. The equation for the case n = 2 
also arises in the search for numbers which are both triangular and square. We shall solve the triangular 


A famous problem posed by Archimedes, concerning breeds of cattle on the square problem in this section. 
island of Sicily, was reduced by elementary algebra to the task of finding 

positive integer solutions of x? — 4729 494y? = 1. The smallest solution 

turns out to have y as an integer of 41 digits and, not surprisingly, it was 

not known to the Greeks! 


In 1759 Euler showed that any solution of z? — ny? = 1 must necessarily 
a 
have — as a convergent of y/n and he went on to discover a general method 


of solution based on the continued fraction of y/n. His paper contained all 
that was needed to show that the Diophantine equation z? — ny? = 1 has 
infinitely many solutions and that all of them are obtainable from the 
continued fraction of yn, although he failed to collate them into a complete 
proof. Lagrange provided this in 1768. 


The equation xz? — ny? = 1 is known as Pell’s equation. So where does Pell 


enter the story? It transpires that Euler wrongly attributed the first method 
of solution by Wallis and Brouncker to the English mathematician John Pell 
(1611-1685), and although this error is now recognized, the name of Pell has 
remained firmly attached to this Diophantine equation. 


So much for the background, let us now make a start at solving this 
equation. If n is a non-square positive integer then y/n is irrational and so 
2 


2 


the equation z? — ny? = 0 (or — = n) has no integer solutions. But if * is 
y y 


2 
: ý . F 
a good rational approximation to y/n then — is close to n, or what amounts 


to the same thing, z? — ny? = k is a small (positive or negative) integer. 


As k cannot equal 0 the next best thing is that k = +1. Having seen that 
the convergents are the ‘best’ rational approximations to y/n, it will 


therefore come as no surprise to discover that every positive solution of If z =a, y = b is a solution then, 
a = ae oe 
2 2 : : for example, x = —a, y = b is 
x” — ny? = 1 arises as a convergent — of yn. The groundwork for provin oi Y 
Y 8 vin gr P 8 another solution. We shall confine 
this has been done in the previous unit. attention to positive solutions. 


Theorem 1.1 Solutions of Pell’s equation are convergents 


If z = a, y = b is a positive solution of x? — ny? = 1 then : is a 


convergent of y/n. 


Proof of Theorem 1.1 


Making use of Theorem 4.3 of Unit 7, we aim to show that if a? — nb? = 1 
then 
a 1 
vn 5] < ap 


pene : a. 
which is sufficient to ensure that 752 convergent of yn. 


As a? — nb? = 1 we have (a — by/n)(a + byn) = 1 and so, 
1 
Suen o and a > byn. 


Therefore 


|m- S| =- - 1 


b b b(a + byn) 
1 1 1 


<ni Pa ~ 20° 


a, 
Hence z752 convergent of yn. E 


Knowing that all the solutions of Pell’s equation are to be found amongst 
the convergents of y/n it remains to identify which, if any, of the convergents 
give rise to solutions. In the example below we have taken the case n = 7 


and worked out a few of the convergents ea V7. We have then calculated 


the corresponding values of p? — 7qz in the hope that the value 1 might turn 
up. 


Example 1.1 


Determine the convergents of 7 up to Cio and find which of these give rise 
to a solution of x? — 7y? = 1. 


The continued fraction of V7 is [2, (1,1,1,4)]. Hence, using the tabular 
method to determine the convergents, we have the following. 


Oo PEE EEE p= 
Pm fe fae fo ffs in 
Pe PPP PEPE PEE 
Pm fe pa [a [fo ff 
alle [ap [ap sha fale 


This reveals two solutions to this Pell’s equation, namely 

r=8, y=3 as 8*-7x3?=1, 

z=127, y=48 as 1277-7 x 48? =1. 
However there is also the beginnings of a clear pattern in the values for 
pz — 7q}. It looks as if the sequence of values —3, 2, —3, 1 is cycling. Could 
it be that C4 and Cg and every fourth convergent thereafter will give a 
solution? + 


Problem 1.1 


(a) Determine the convergents of V3 = [1, (1,2}] as far as Cio and check 
which of these satisfy p? — 342 = 1. 


(b) In the same way, find three convergents of v10 = [3, (6)] which give 
solutions of x? — 10y? = 1. 


Any lingering hope that each convergent would give rise to a solution of the 
corresponding Pell’s equation have now been dispelled. On the positive side, 
however, for n = 3, 7 and 10, the convergents have, in each case, led to at 
least one solution. There are also encouraging signs from the row of p? — ng? 
in each of the constructed tables; the values here, like the partial quotients 
in the ICF of y/n, appear to cycle. But the key observation to make from 
these examples concerns where the solutions arise. For n = 3 and for n = 7 
the solutions of Pell’s equation arise from the convergents corresponding to 
the penultimate partial quotient in the cycle. For example, the cycle in the 
continued fraction of v7 ends in a 4 and it is the convergents calculated from 
the partial quotient 1 immediately prior to this 4 which give the solutions. 


v3 = (1, (1, 2)] VIS [2, (1, 1,1, 4)] 


Solutions to Pell’s equation arise 
from convergents corresponding to 
this partial quotient. 


Figure 1.1 Solving Pell’s equation for the cases n = 3 and n = 7 


The case n = 10 is similar, but Problem 1.1 part (b) gives more information. 
As the cycle in the ICF of v10 has length 1, every partial quotient comes 
immediately prior to the end of a cycle. But they do not all give rise to 
solutions; it appears that the even convergents give rise to a solution of the 
equation z? — 10y? = 1 while the odd convergents give rise to solutions of 
x? — 10y? = —1. This time it looks as if the selected convergents all give rise 
to solutions of x? — 10y? = +1. We shall show that this is the case shortly. 


1.2 Solution of Pell’s equation 


In the proof of the next theorem we need to make use of a result which we 
gave without proof in Section 3.3 of Unit 7. This result states that, in 
general, the ICF of yn has the form 


vn = lai, (a2, a3, +++ A3, Q2, 2aı)]. 


Let the number of partial quotients in the cycle of yn be s. Then, with the We say that the cycle length of the 


familiar notation = for the convergents of \/n, ICF of y/n is s. 
k 


Ps P2s 
ds = [a1, a2, a3, . : . a3, a2]; Qos = [a1, a2, a3, os . 3, Q2, 201, A2, a3, oes . a3, a2] 
s 2s 


and, in general, 2u is the convergent obtained by terminating in the rth 


TS 
cycle immediately before the final partial quotient 2a,. We are now in a 
position to state our main result. 


Theorem 1.2 Solution of Pell’s equation 


If the continued fraction of y/n has cycle length s then 


rs, 


Pes — ng, = (—1) ’ aa Wi io Serer 


and all solutions of 
x? — ny? = +1 


are given in this way. 


So, according to the theorem, as v14 = [3, (1,2,1,6)] has cycle length s = 4, 


the convergents ee 2 piz ... satisfy p? — 14q? = 1. On the other hand as 
q4 qs 412 
v29 = [5, (2, 1, 1,2, 10)] has odd cycle length s = 5, the convergents 
pio P20 P30 
gio’ 920° 930° 


satisfy p? — 29qz = 1, while the convergents 
P5 PIs P25 


qs 915° 925° 
satisfy p? — 29q2 = —1. 
In what follows we are going to prove only the main part of the theorem, 
namely that the listed convergents do indeed give solutions as claimed. We 


shall omit proof of the converse, namely that all solutions of x? — ny? = +1 


arise in this way. It is not difficult to prove that any solution of 


x” — ny? = +1 must have "asa convergent of yn. Rather than embark on 


y 
a messy proof here that it must be one of the claimed convergents, we shall 
give, in Theorem 1.3, an alternative approach to obtaining ‘all’ the solutions. 


Proof of Theorem 1.2 
For each r > 1 we can write y/n as the non-simple finite continued fraction 
vn = [a1, a2, a3, - - - 03, A2, 201, A2, 43,... , 43, Q2, 1] 


with a total of rs + 1 partial quotients, the last of which, x, is not an 
integer. In fact 


J [(2a1, a2, 43,- - , 43, a2)] 
= @ı + [aı, (a2, a3,-++,43, 42, 2a,)| 
=Q + Vn. 
The final three convergents in the above finite continued fraction for y/n are 
Prs-1 Prs aa Lprs + Prs—1 í 
drs—1 drs Tars + qrs—1 
Now the last of these three is equal to \/n itself and so 
Vn(xqrs at Grs=1) = Prs + Prs—1; 
and substituting a; + y/n for x, we get 
Vn((a1 + Vn)drs E drsi) = (a, =F: VNn)Prs + Prs—1- 
This simplifies to 


Vn(a14rs + Grs—1 — Prs) = 41Prs + Prs—1 — NGrs- 


The right-hand side of this equation is an integer while the left-hand side is 
yn times an integer. Since y/n is irrational, the only way that equality can 
occur is when both sides are equal to zero. Hence we have the two equations 
Q19rs + rs—1 = Prs 
and 
Q1Prs + Prs—1 = NQrs- 
Multiplying the first of these equations by prs and the second by qrs and 
then subtracting gives 
De. = na, = PrsQrs—1 — Prs—14rs, 


the right-hand side of which is equal to (—1)"® by Theorem 1.3 property (a) 
of Unit 7. a 


When the cycle length s of the continued fraction of y/n is even we have 
(—1)"* = 1 and Theorem 1.2 tells us that £ = Prs, Y = drs is a solution of the 
Pell’s equation for each r > 1. On the other hand, when s is odd, £ = prs, 


Y = qrs is a solution of 2? — ny? = —1, when r is odd, and is a solution of 

x? — ny? = 1, when r is even. 

Problem 1.2 

Given that v11 = [3, (3,6)], find the three smallest positive solutions of 
Pee ae ee 

£ BAr =. 

Problem 1.3 

Given that v13 = [3, (1,1, 1, 1, 6)], find one positive solution of each of the 

equations : 


T 134? =1 and z?— 13y? =-l. 


To complete our survey of Pell’s equation, we shall demonstrate an 
alternative way of finding all the solutions. 


1.3 Solutions from the fundamental solution 


We shall refer to the solution of x? — ny? = 1 in which x and y take their If we have two solutions, £ = 21, 
least positive values as the fundamental solution. From Theorem 1.2 we y = yı and x = x2, y = y2, then we 
know that, if the ICF of yn has cycle length s then the fundamental can readily deduce from the 

lution is given by x = = qs when s is even, and by z = = ee ee 
So. ; S YY L=Ps, Y =e ; y © = Ps, Y = Qs if, and only if, y1 < y2. So the 
when s is odd. solution with the smallest x value 


There is a simple algorithm for constructing all solutions from the a g Rowe the eanbiest-yvalue. 


fundamental one. 


Theorem 1.3 All solutions from a fundamental solution 


Let x = z1, y = yı be the fundamental solution of x? — ny? = 1. Then, 
for each integer k > 1, £ = £k, Y = Yk is also a solution, where the 
positive integers £ and y are given by 


or t+ yeVn = (a1 +y vn)" 


Conversely, the solutions given in this way are the only positive 
solutions. 


Before embarking on the proof let- us look at an example to be sure we 
understand the process implied in the statement of the theorem. 


Example 1.2 
Find four solutions of z? — 8y? = 1. 
Since V8 = [2, (1, 4)], which has a cycle of length 2, the second convergent 


C2 = F gives the fundamental solution zı = 3, yı = 1. Note that 3? — 8 x 1° = 1. 


Further solutions are found as follows. 
2 
(3+1 x v8) = 17+ 6v8, 


so £2 = 17, y2 = 6 is a solution. 


(3+1x v8) = (17 + 6v8) (3+ v8) = 99 + 35v8, 


so 23 = 99, y3 = 35 is a solution. 


(3+1 x v8)" = (17 + 6v8) (17 + 6v8) = 577 + 204V8, 


so £4 = 577, y4 = 204 is a solution. 4 


Our proof of Theorem 1.3 is not particularly difficult but does involve a 
good deal of careful algebra. You will not be expected to reproduce this 
proof and can omit it if you so wish. 


Proof of Theorem 1.3 


The proof that each x = £k, Y = yx is a solution follows quickly from the 
observation that 


(zı +yivn)* = £k +ykvn 4> (a1- yı vn)" = Ek — yen. 


To see why this is so, imagine the binomial expansions: 


k = = 2 = 3 
(ei +yivn) = 2b + ¥ Chat ly vn + "Cott? (yin) + FC3at ? (yiyn) +- 
and 

k = = 2 = 3 
(zı = yvn) = af — "Ori ty vn + "Cort? (yn) —* C3}? (ivn) +- 
We wish to collate the terms which do not involve \/n and those which do. 
The terms which do not involve yn are those where yn is raised to an even 
power. These are identical in the two expansions, and so if they sum to £k 
in one they do likewise in the other. The terms in which y/n is raised to an 
odd power are the same in the two expansions except, in the second, every 


sign has been changed. So in the first expansion they sum to yz./n and in 
the second to —yk y/n. 


Therefore, 
T? — nyk = (£k + yk Vn) (zk — yk vn) 
k k 

= (a1 $ yı vn) (zı = yı vn) 

= (z} — nyz)* =1"=1. 
It remains to show that the solutions x = £k, Y = Yk obtained in this way 
are the only solutions. In the hope of reaching a contradiction suppose that 
x =a, y = b is some other positive solution. Then, since x = 2, y = yj is 


the smallest positive solution and x; + yı yn > 1, there exists a positive 
integer k such that 


(a, + yin)" <a+byn < (xı + yrvn)** ; 


We now wish to multiply through this inequality by (zı — yı vn)". Notice 
that 


Ai 
zı — yı vn = 


= —— > 0, 
zı +Yyı y/n 


10 


and so multiplication by the positive amount (x; — yı vn)? will preserve 
order. Moreover, we have seen that this multiplier is equal to 7, — Yk Vn, 
and so 


(zı +y vn)" (21 — yı vn)? < (a+ bvn) (ze — ye) < (z1 +y vn)" (21 - 


This expression simplifies to 
1< (a+ byn) (£k — yk Vn) < £1 + yı vn. 


Now (a + byn) (£k — Yk y/n) = c + dyn, where c = azp — nby, and 
d = br, — ayp. A little algebraic manipulation confirms that 


Ê — nd? = (a? — ny?) (a? — nb?) = 1. 


Therefore x = c, y = d is a solution of the equation and the established 
inequality, which can now be written as 


1<c+dyn < zı +yıvn, 


will contradict the fact that zı + yı y/n is the fundamental solution, provided 
we can confirm that c and d are both positive. For this result, note that 
since (c + dyn) (c — dyn) = 1 and c+ dy'n > 1, we have 0 < c — dyn < 1. 
Therefore 


2c = (c+ dvn) + (c— dyn) >1+0, giving c> 0, 
and 

2d/n = (c + dvn) — (c — dvn) >1-1, giving d> 0. 
So the proof is complete. (2 
Although we have concentrated on the equation x? — ny? = 1 we could 
apply our line of proof in Theorem 1.3 to the allied equation x? — ny? = —1. 
This equation will not have any solutions unless the cycle in the ICF of yn 
has odd length. For those n which do admit solutions we have the following 
result. Its proof, which we shall not spell out here, involves no more than 
retracing the steps of the above proof and changing signs where appropriate. 


Theorem 1.4 


Suppose that the ICF of y/n has a cycle of odd length s. Let £1 = pg, 
Yı = ds, where the convergent C, = a and let x; and yp be given by 


s 


te + gvn = (a1 +yvn)*. 


Then, for all k > 1, £ = £2k-1, Y = Yor_1 is a solution of 
x? — ny? = —1 and x = Tək, Y = yor is a solution of x? — ny? = 1. 


Moreover, all solutions of x? — ny? = +1 are given in this way. 


Problem 1.4 


Given that v17 = [4, (8)] find the smallest solution of x? — 17y? = —1 and 
hence find two positive solutions of z? — 17y? = 1. 


Problem 1.5 

For a number to be both triangular and square there must exist integers m 
1 -1 

and n such that er) = m?. Substituting n = — and m = 2 this 


equation becomes x? — 2y? = 1. Use the solutions of this Pell’s equation to 
find five triangular squares. 


yrVn)*. 


Note that v2 = [1, (2)]. 
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2 THE PYTHAGOREAN EQUATION 


2.1 Primitive Pythagorean triples 


The theorem of Pythagoras, which states that the sum of the squares on the 
two shorter sides of a right-angled triangle is equal to the square on the 
hypoteneuse, is arguably the most celebrated theorem in mathematics. But 
in addition to being a classical theorem of geometry, it presents challenges 
for the number theorist when the underlying equation x? + y? = z? is 
considered as a Diophantine equation. We refer to this as the Pythagorean 
equation. Interest focuses on right-angled triangles all three of whose sides 
are positive integers. The Pythagorean equation x? + y? = 2? certainly has 


solutions in positive integers, including the well-known one which has the We shall continue the geometrical 
smallest value of z, namely 3? + 4? = 5°. It is also immediate from this one analogy by referring to the value 
solution that the equation has infinitely many solutions since, for any of z in any solution as being the 


positive integer k, (3k)? + (4k)? = (5k)?. But there is a sense in which this sb 
infinite family of solutions is really just one solution. Geometrically, the 

corresponding triangles are similar; we have the one basic triangle with sides 

3, 4 and 5 and the others are obtained by scaling each side by factor k. 


Figure 2.1 Pythagoras’ Theorem: x? +y? = z? 


Before progressing further let us introduce some terminology. 


Definition 2.1 Pythagorean triples 


A Pythagorean triple is a triple (x,y,z) of positive integers such that A triple is ordered so that the 
x? +y? = z?. The triple (x,y, z) is said to be primitive if gcd(z, y) = 1. apoi neas is always the third 
member. 


Notice that the condition ged(x, y) = 1 guarantees that x and y have no 
common divisor greater than 1, and consequently the primitive triple 
(x,y,z) cannot be a scaling of some smaller triple. Moreover, the condition 
gcd(x,y) = 1 carries the implications that ged(«, z) = gcd(y, z) = 1, since 
from the equation x? + y? = 2? it is readily shown that any common divisor 
of two of the variables must divide the third. 
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Problem 2.1 


Which, if any, of the following are Pythagorean triples? For each 
Pythagorean triple, decide whether or not it is primitive. 


(a) (10,8,6) (b) (12,9,15) (O (6,7,8) — (d) (5,12, 13) 
(e) (24,33, 41) 


We now have two primitive Pythagorean triples, namely (3, 4,5) and 
(5, 12,13). Are there any more? Table 2.1 displays the start of a curious 
infinite family of primitive solutions. 


Table 2.1 A family of primitive Pythagorean triples 


a y A 

21 220 221 

201 20 200 20 201 
2001 2 002 000 2002001 
20001 200 020 000 200 020 001 
200 001 20 000 200 000 20 000 200 001 
2000001 2 000 002 000 000 2 000 002 000 001 


20000001 200 000 020 000 000 200 000 020 000 001 


The Babylonians, some 3500 years ago, were aware of many primitive 
solutions and Pythagoras himself is credited with the first infinite family of 
solutions given by 


e=2k+1, y=2k?4+2k, z=2k?+2k+1, for any integer k > 1. 


(The solutions displayed in Table 2.1 are taken from this family by 
choosing k to be 10, 100, 1000, ....) But this infinite family does not 
exhaust all primitive solutions; for example it does not include the solution 
(8, 15,17). The first complete solution of the Pythagorean equation 
appeared in Euclid’s Elements. We aim to reproduce that solution here, but 
first there are a few points to be clarified. 


If (x,y,z) is any Pythagorean triple then x and y cannot both be odd for if 
they were then 


=o +y? =1+1=2 (mod 4), 


which is impossible as all squares are congruent modulo 4 to either 0 or 1. It 
follows that in a primitive triple and y have opposite parity because the If integers x and y are both odd or 


further condition that gcd(«, y) = 1 ensures that they cannot both be even. both even they are said to have the 
same parity; otherwise they are 
If (x,y,z) is a primitive Pythagorean triple then so too is (y, x, z) because said to have opposite parity. 


the x and y values can certainly be interchanged. But, to all intents and 
purposes, the triples (x,y,z) and (y,2,z) lead to what is really the same 
solution of the Pythagorean equation. To overcome the need to distinguish 
between these two equivalent solutions we shall choose to write the even 
member first in any primitive triple. We adopt the following convention. 


Convention for Pythagorean triples 


If (x,y,z) is any primitive Pythagorean triple, x is even while y and z 
are both odd. 
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With the preparation now complete we are ready for the main theorem of 
this section. 


Theorem 2.1 Primitive solutions of the Pythagorean equation 


The primitive Pythagorean triples are the triples 
ee 


(2mn, m? — n?, m + n?), 


where m and n are relatively prime positive integers of opposite parity 
and with m > n. 


Before embarking on the proof let us clarify the claim of the theorem by 
looking at an example of its use. 


Example 2.1 
Find all primitive Pythagorean triples (60, y, z). 
We seek relatively prime integers m and n such that 2mn = 60, m and n 
have opposite parity and m > n. There are four pairs of positive integers 
satisfying mn = 30 with m > n and in all four cases m and n are relatively 
prime and have opposite parity: 

m = 30, n = 1 giving (60, 899, 901); 

m= 15, n= 2 giving (60, 221, 229); 

m = 10, n = 3 giving (60, 91, 109); 

m= 6, n=5 giving (60, 11,61). 


These four are the only primitive Pythagorean triples (60, y, z). 4 


Proof of Theorem 2.1 


We must first show that the given triple is a primitive Pythagorean triple. 
Substituting x = 2mn, y = m? — n? and z = m? + n? in the Pythagorean 
equation we have 


x? +y? = (2mn)? + (m — n?)? 
=m + 2m?n? + n* = (m +n”)? = 2. 


So this is indeed a Pythagorean triple. To establish the primitive property 
suppose to the contrary that p is a prime divisor of both m? — n? and 

m? +n?. Then p divides both the sum (m? + n?) + (m? — n?) = 2m? and 
the difference (m? + n?) — (m? — n?) = 2n?. But p # 2 (since y is odd) and 
we conclude that p divides m and p divides n. As m and n are given to be 
relatively prime we have the required contradiction, and so the triple is 
primitive. 


Conversely we have to show that every primitive Pythagorean triple is of the 
stated form. With this goal in mind suppose that (x,y,z) is any primitive 
Pythagorean triple. As y and z are both odd, we can define integers 
Se tan) ay 

2 oe 


$ and t = 


We note that s and ¢ are relatively prime because any common divisor 
would also divide s+t=zands—t=y. 
Then from the Pythagorean equation 

g? =z? =y =(z+y)(z — y) = 2s x 2t 


we obtain 
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Observe that the even value, 
x = 2mn, is always a multiple of 4. 


As x is known to be even this equation confirms that the product st is a 
perfect square. Therefore, since s and t are relatively prime, they must each 


be squares. So if we now write s = m? and t = n? and substitute back: This follows from the work of 
2 A ae Unit 2, but see also Problem 2.4 at 
x =4st=4m*n*, giving x = 2mn; the end of this subsection. 


y=s—t=m?—-n’,; 


z=s+t=m +n. 


Finally, note that gcd(m, n) = 1, because any common divisor of m and n 
would necessarily be a common divisor of the relatively prime integers 

s = m? and t = n?. Moreover m and n must have opposite parity for 
otherwise y and z would be even. 


The triple (x,y, z) has thus been expressed in the required form. a 


All Pythagorean triples can be obtained by scaling primitive ones; that is, 
any Pythagorean triple is of the form (ka, ky, kz), where (x,y,z) isa 
primitive triple. Notice that our convention for primitive triples carries over 
to all triples. For example, consider the triples (6,8, 10) and (8,6,10) which 
we wish to regard as being the same. Since the underlying primitive triple is 
(4, 3,5) we shall conventionally write this multiple as (8,6, 10), rather than 
(6,8, 10). In other words, the first member in the triple is the multiple of the 
even member in the underlying primitive triple. With this clarified we can 
now record the following Corollary. 


Corollary All solutions of the Pythagorean equation 
The Pythagorean triples (x,y,z) are given by 


z=2kmn, y=k(m?—n?), z=k(m? +n’), 


where k > 1 is any integer and m and n are relatively prime positive E E ie aves 


integers with opposite parity and m > n. Pythagorean triples (x,y,z) with 

C= Y: 

In the next example we make use of Theorem 2.1 to begin an enumeration of 

all the primitive Pythagorean triples. 

Example 2.2 

There are 16 primitive Pythagorean triples (x,y,z) with hypoteneuse 

z < 100. Find them. 

We find primitive triples by listing pairs of relatively prime integers m and n 

which have opposite parity and with m > n. For each such pair the 

corresponding triple is calculated from the formulae in Theorem 2.1. For 

example, taking m = 7 each of the values n = 2, 4 and 6 will lead to a For m = 7 and n= 1, 3 or 5 the 

primitive triple. resulting triple will be Pythagorean 
but will not be primitive because 

The condition that z < 100 amounts to m? + n? < 100. Hence m < 9 and, m and n have the same parity. 


for example, the triple resulting from m = 8, n = 7 is not wanted because its 
hypoteneuse exceeds 100. The 16 solutions are as follows. 


Table 2.2 Primitive triples with sides not exceeding 100 


mn triple m n triple 

2AA T 228a 
E ees N TKA 7 4 (56,33,65) 
4- (8,15; 17) 7 6 (84,13,85) 
4-3. (24,7, 25) 8 1. (16,63, 65) 
5 2 (20, 21,29) 8 3 (48,55, 73) 
5 4  (40,9,41) 8 5 (80,39, 89) 
-E — (2735530) 9 2 —(365-27585) 
6 5 (60,11,61) 9 4 (72,65,97) 
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Problem 22 = = — es Se ret ls Eh eee 
Find all primitive Pythagorean triples (72, y, z). 


Problem 2.3 


Find all Pythagorean triples, primitive or not, in which one of the sides is 30. 


Problem 2.4 


Give a proof of the step assumed in Theorem 2.1 that, if a product of two 
relatively prime integers is a square, then each of the integers must be 
square. 


2.2 Special Pythagorean triples 


From the classification of Theorem 2.1 it is not difficult to see exactly which 

integers can arise as members of a primitive Pythagorean triple. If we 

choose m and n to be consecutive integers m = k + 1, n =k (for any k > 1) 

then they are relatively prime and of opposite parity and so give the 

primitive triple (2k? + 2k, 2k + 1, 2k? +2k+1). The middle value, 2k +1, This is the family discovered by 
shows that every odd integer from 3 onwards can be a member of a primitive Pythagoras which we mentioned 
Pythagorean triple. As far as even integers are concerned, the even member © ier, but with x and y values 
is z = 2mn, where m and n have opposite parity, and consequently x is a Seer eR AN roen, 
multiple of 4. This means that no integer which is congruent modulo 4 to 2 

can be a member of a primitive Pythagorean triple. However the choice 

m = 2k and n = 1 gives the triple (4k, 4k? — 1, 4k? + 1) and proves that 

every multiple of 4 occurs as a member of a primitive Pythagorean triple. So 

every integer n > 1, with the exception of those n = 2 (mod 4), occurs in 

some primitive Pythagorean triple. 


What about occurrences of primes? Not all three members of a Pythagorean 
triple can be prime because one member is a multiple of 4. On the other 
hand, Table 2.2 reveals several instances where both the odd members of a 
primitive triple are prime. Suppose that the y-value in a triple is prime, say 
y = p. Then, from z? + p? = z? we have 

p =z —2* =(2+2)(z—2). 


As z > a > 0, the two terms on the right of this equation are distinct and 
positive. Now there is only one way that p? can be written as the product of 
unequal positive divisors, namely p? = p? x 1. Hence 


z+a=p* and z—-2=1, 


and solving these equations for x and z gives the primitive triple 


2—1 2+1 
(2 > P, zat y As p? = 1 (mod 8) we have 
2 2 Ta 


So a Pythagorean triple contains two prime members whenever the prime p 


confirmation that is 


divisible by 4. 


1 
is such that a is also prime. Many examples of such pairs of primes are 


known, but it is one more of the many unsolved problems of number theory 
as to whether or not there are infinitely many such related pairs of primes. 


To round off this section let us look at one more associated problem. The 
triple (4,3,5) has the property that its three sides are consecutive integers. 
It is not difficult to show that no other triple can have this property. There 
are certainly instances where two of the members are consecutive. As we 
discovered above, each triple in the infinite family of primitive triples of the 
form (2k? + 2k, 2k +1, 2k? + 2k + 1) has the even member and the 
hypoteneuse being consecutive integers. The remaining question concerns 
the possibility of the two shorter sides being consecutive. Table 2.2 
uncovered two triples with this property, namely (4,3,5) and (20, 21, 29). 
Are there any more? 
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Example 2.3 
Find primitive triples (x,y, z) in which x and y are consecutive integers. 
Adopting the notation of Theorem 2.1, suppose that £z = 2mn and 


y =m* —n* are consecutive integers. That is, 


m? —n? =2mn 1. 
Taking all terms involving m to the left-hand side and completing the square 


gives 
(m —n)? = 2n? +1. 


Now the substitution r = m — n yields 


r? — Qn? = +1. 


This should look familiar from the work of the previous section. All solutions 
in r and n arise from convergents in the ICF of V2 = [1, (2)] which begin 


pt, Te 
i site? Saas coe EE A 
As the cycle of v2 has length 1, each convergent = gives a solution 
k 
T = Pk, Nn = qk Of one of the equations r? — 2n? = +1, and the required value Theorem 1.2 
of m is recovered by means of m = r +n. The first five convergents lead to 
triples as follows. 
(ea ahs eee y 
iE 7- 4 3 
Boe rp: a 21 
0 ES 
17 12 29 696 697 
4059 5741 + 


As a diversion we extend the table begun in Example 2.3 by listing the first twenty 


Diversion 
Pythagorean triples with consecutive shorter sides. 

4 3 5 

20 21 29 

120 119 169 

696 697 985 

4060 4059 5741 

23660 23661 33461 

137904 137903 195025 

803760 803761 1136689 
4684660 4684659 6625109 
27304196 27304197 38613965 
159140520 159140519 225058681 
927538920 927538921 1311738121 
5406093004 5406093003 7645370045 
31509019100 31509019101 44560482149 
183648021600 183648021599 259717522849 
1070379110496 1070379110497 1513744654945 
6238626641380 6238626641379 8822750406821 

36361380737781 51422757785981 
299713796309065 
1746860020068409 


36361380737780 
211929657785304 211929657785303 
1235216565974040 1235216565974041 
AE E EE R E ETE O R E SEAS EEE IGE a 
T7 


Problem 2.5 


Find all Pythagorean triples in which the three sides are in arithmetic 
progression. 


Problem 2.6 


Show that there are infinitely many Pythagorean triples in which the shorter 
sides are consecutive triangular numbers 7;, and T41. Hint: If (m,m+1,n) 
is a Pythagorean triple try the triangular numbers Tom and T2m+1. 


Diversion 


It is possible to have primitive Pythagorean triples all three of whose sides are 
triangular numbers. For example, 


2 
Tis2 + Tiss = Ties- 
It is not known whether infinitely many such triples exist. 


Is it possible to have a primitive Pythagorean triple all three of whose members 
are squares? 


3 FERMAT’S LAST THEOREM 


3.1 Some history of the Last Theorem 


Fermat’s refusal to publish his discoveries led to one of the great stories in 
the history of mathematics. As well as writing letters to his friends about 
various results, he formed the habit of jotting notes in the margins of his 
reference books. These notes to himself comprised brief summaries of his 
discoveries. Some five years after his death his copy of Bachet’s Diophantus 
was found to contain mention of many results from number theory. On the 
page dealing with the Pythagorean equation Fermat had added in the 
margin: 


On the other hand it is impossible to write a cube as the sum of two 
cubes, a fourth power as the sum of two fourth powers and, in general, 
any power higher than the second as the sum of two similar powers. I 
have discovered a truly marvellous proof of this, but the margin is too 
small to contain it. 


In this, Fermat was claiming to have proved that the Diophantine equation 
x” +y” =z” has no positive solutions for n > 3. No evidence of how 
Fermat reached his conclusions survives, and despite three and a half 
centuries of continual efforts to solve the problem, nobody managed to 
prove, or disprove, Fermat’s assertion. The difficulties encountered in 
attempting a general proof have convinced many mathematicians and 
historians that Fermat was mistaken and did not really have a proof. 
Measured against this is the fact that Fermat was a phenomenal 
mathematician and nothing which he ever claimed to have proved has 
subsequently been disproved. In the single instance in which he erroneously 
claimed something to be true, namely that the Fermat numbers are all 
prime, he left his contemporaries in no doubt that he was unable to prove it. 


As this was the last of Fermat’s results awaiting proof or refutation it has 
become known colloquially as Fermat’s Last Theorem. Strictly speaking, it 
should not be a theorem until a proof has been found, and some texts refer 
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to it as Fermat’s Conjecture. However, for good reasons which we shall give 
shortly, we shall use the former, more notorious name. 


Consider the equation z” + y” = z”, for n > 3, and suppose the exponent n 
is composite, say n = kr. The equation can then be rewritten as 


(x*)" + (y*)" = (2*)’. 

It follows that if x = x9, y = yo, z = 20 is a solution of x” + y” = z” then 
z=26, Y= z= 2 

is a solution of £” + y” = z”. Conversely, if it can be established that 


x" +y” = 2" has no positive solution then it follows that z” + y” = z” must 
have no positive solutions. 


Now every integer n > 3 is divisible by some odd prime or by 4. The proof 
of Fermat’s Last Theorem can therefore be broken down to two tasks. If 
each of the following can be proved 


e x*+y* = 2‘ has no positive solution 


e «x? +y? = z? has no positive solution for any odd prime p 
then it will follow that 2” + y” = z” has no positive solution for any n > 3. 


Fermat himself did leave us an elegant proof for the case of the exponent 4, 
which we shall present in this section. The real challenge, however, is in 
proving Fermat’s Last Theorem for odd prime exponents. In 1770 Euler 
resolved the case p = 3, although his proof was not quite complete. Half a 
century later Dirichlet and Legendre gave independent proofs for the case 

p = š and in 1835 Lamé supplied a proof for the case p = 7. But the proofs 
of these cases adopted very different approaches and it was becoming clear 
that to reach a proof applicable for all odd primes p some new approach was 
needed. Lamé thought he had made the breakthrough in 1847. In 
attempting to solve x? + yP = z?, instead of working within the set of 
integers Lamé extended his system by looking at numbers of the form 

m + na", where m and n are integers, a? = 1 and 0 < r < p. He thought he 
had succeeded in proving that the equation had no non-trivial solution in 
this number system, and therefore no positive integer solution. Alas, he 
made one serious error. He assumed that numbers in his new system would 
factorize uniquely. They do not, for the equivalent of the Fundamental 
Theorem of Arithmetic, which leads to unique factorization in the integers, 
does not hold in this ring of numbers. 


The German mathematician Kummer, who may have been simultaneously 
pursuing similar lines to Lamé, did achieve a significant advance. To 
overcome the lack of unique factorization his remedy was to extend his 
number system in a different way inventing the so-called ideal prime 
divisors. Kummer successfully proved Fermat’s Last Theorem for the case of 
exponents which he classed as regular primes. This was no mean 
achievement for the regular primes include all primes less than 100 with the 
three exceptions of 37, 59 and 67. Kummer believed that he had proved 
Fermat’s Last Theorem for infinitely many primes but it remains another 
unsolved problem of mathematics as to whether or not there are infinitely 
many of Kummer’s regular primes; ironically, however, it has been known 
since 1915 that there are infinitely many irregular ones. 


By developing Kummer’s ideas, mathematicians have settled more and more 
cases of Fermat’s Last Theorem over the years. The arrival of electronic 
calculators inevitably speeded up discoveries and, for example, by 1980 
Fermat’s Last Theorem had been shown to hold true for all odd prime 
exponents not exceeding 125000. By 1994 that had been advanced to all 
prime exponents not exceeding 4000000. 


One significant breakthrough was achieved in 1983 by a young German 
mathematician, Gerd Faltings. He proved that for any given n there is at 


These numbers are called algebraic 


numbers and Lamé had invented 
the first example of what 
algebraists today call a ring. 
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most a finite number of solutions of 2” + y” = z”. All in all, the 
overwhelming numerical evidence left little doubt that the result asserted in 
Fermat’s Last Theorem must be true, but that elusive complete proof was 
still missing. . 


Over the years a number of prizes have been offered for the first general 
proof of Fermat’s Last Theorem. The Academy of Science at Paris offered 
prizes in 1823 and again in 1850, and the Academy of Brussels put up a 
prize in 1883. But most memorable, in 1908 an enormous prize of 100000 
marks was bequeathed to the Academy of Science at Gottingen for the first 
complete solution of Fermat’s Last Theorem. The entry condition that the 
solution must be printed no doubt deterred many but, nevertheless, over 
1000 solutions were submitted during the next four years. Inevitably these 
submissions were either erroneous or incomplete. Landau, a distinguished 
German number theorist of that period, was burdened with the task of 
checking the submissions. It is reported that he had postcards printed which 
read “Dear Sir or Madam, Your attempted proof of Fermat’s Last Theorem 
has been received and is returned herewith. The first mistake is on page ..., 
line ... .” The job of filling in the missing entries was given to Landau’s 
research students! 


German inflation of the 1920’s reduced the value of the Gottingen prize to 
virtually nothing, but the pursuit of a proof of Fermat’s Last Theorem has 
not waned. In 1988 the mathematical world was excited by the news that a 
Japanese mathematician, Yoichi Miyaoka, had resolved Fermat’s Last 
Theorem. Alas, as so often before, holes were found in the purported proof. 
Then in 1994 the distinguished British mathematician Andrew Wiles, now 
working at Princeton University, gave a series of lectures at Cambridge 
University which appeared to have culminated in a proof of Fermat’s Last 
Theorem. Although holes were initially found in this proof, it was believed 
that they could be plugged. The proof is of some 1000 pages and takes an 
incredibily circuitous route to the result, straying a long way outside 
Elementary Number Theory. But, at the time of writing this course, the 
mathematical world is holding its breath in the growing belief that Wiles 
has, at long last, put proof of Fermat’s Last Theorem to rest. 


3.2 The equation zt + y* = z* 

In the remainder of this section we shall confine our attention to the special 
case of Fermat’s Last Theorem for exponent 4 and some related Diophantine 
equations. We shall use Fermat’s method and, just as he did, we shall prove 
the stronger result that z4 + y* = z? has no positive solution. The proof 
uses our solution to the Pythagorean equation together with a powerful 
technique known as Fermat’s method of infinite descent. 


The idea behind the method of infinite descent as applied to this problem is 
as follows. We establish that from any positive solution 7 = £k, Y = Yk, Z = 2k 
of the Diophantine equation z4 + y* = z? we can always find another 

‘smaller’ positive solution £ = £k+1, Y = Yk+1,; Z = Zk+1, Where by smaller 

we mean that 0 < 2441 < zx. This process of being able to go from one 
positive solution to a smaller one is the key stage of the method, and is 

called the descent step. 


Now suppose that the equation does have a positive solution = 21, y= Y1, 
z = 2. The established descent step would then tell us that, from this, we 
have a smaller positive solution, and from this second solution a still smaller 
third positive solution, from which we get a smaller fourth positive solution, 
and so on. Focussing on the z-values we have an unending decreasing 
sequence of positive integers 


Be S293 St 
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As there are only finitely many positive integers which are less than z; this 
is plainly impossible. Hence the one assumption that we have made, namely 
that one positive solution exists, is contradicted. 


We have chosen to focus on the variable z here. But notice that descent 
could be achieved on any of the variables. For example, if we established 
that any positive solution of the equation must give rise to another positive 
solution with smaller value for y, then the conclusion that no solution can 
exist follows in the same way. 


What is really being brought into play here is the Well-Ordering Principle 
which asserts that any non-empty set of positive integers must have a least 
member. The descent step appears to construct a set of positive integers 
which does not have a least member, and so it has to be the empty set. 


Theorem 3.1 


The Diophantine equation x* + y* = z? has no positive solutions. 


The proof of Theorem 3.1, and that of Theorem 3.2 which follows shortly, 
involve some tricky algebra. You are not expected to master the details of 
these proofs, but you should read each of them carefully to see how the 
method of infinite descent is used. 


Proof of Theorem 3.1 


Suppose to the contrary that zt + y4 = z? has a positive solution x = 21, 
y=, Z = 21. If ged(x1,y1) = d > 1 then putting x; = dz’ and yı = dy’ 
gives 2? = d’ (xt + y’*), from which it follows that d? divides zı so that 

zı = d*z', for some integer z’ < z1. But then x’ + y/4 = z’?, where 
gced(x’,y’) = 1 and z’ < z1. This argument shows that we may assume that 
ged(a1, y1) = 1 for otherwise any common divisor can first be cancelled 
leaving another, smaller, solution to the same equation. 


Writing the equation as (x?)? + (y?)? = 2? we observe that (27, y?, 21) is a 


primitive Pythagorean triple and so, by Theorem 2.1, 


2 2 


x? = 2mn, y =m -n?, a=m +n?, 


where m and n are relatively prime positive integers of opposite parity with 
m >n. In fact n must be even and m odd, for otherwise we have 
y? =m? =r =0-1=3 (mod 4), 


which is impossible because any square is congruent modulo 4 to either 0 
or l. 


Putting n = 2r the equation for x; becomes 


(F) =m 
9 s 7 


n : z 
where m and r = 3 are relatively prime. Hence m and r are each squares, 
say m = s? and r = ??. 


Returning to the equation y? = m? — n? 


Pythagorean triple and so 


2 2 


n=2u, y=uw—-v, m=u +2, 


where u and v are relatively prime positive integers of opposite parity with 
UU: 


Now n = 2r = 2t?, and so the first of these equations becomes uv = t?. It 
follows that that u and v are each squares, say u = x3 and v = y3. Feeding 
these facts into the equation m = u? + v? gives 


Toe 2 2 
Ba tyz =m =s" (= 23, $ay), 


, we see that (n, y1, m) is a primitive 


Infinite descent bears some 
resemblance to mathematical 
induction. The descent step, like 
the induction step, sets up an 
unending chain of implications of 
the form ‘if one is true then so too 
is the next’. The difference is that 
this time we know that no such 
infinite chain of true statements 
can exist and we are forced to 
conclude that the chain cannot be 
initiated; it has no basis. 


Rather than use the general suffix 
k as in the preamble, we establish 
the descent step here by showing 
that any positive solution x1, y1, 21 
must lead to a positive solution 22, 
Y2, 22 in which 0 < z2 < zı. 


If needed we may interchange xı 
and yı to ensure that xı is even. 
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revealing another solution of the original equation. But this solution is 
‘smaller’ since 


0<z2=s<m<m +r =z. 


This completes the descent step; any positive solution gives rise to a smaller, 
positive one. As this is impossible, the assumption that there exists a 
positive solution is contradicted. a 


As any fourth power is necessarily a square, any positive solution of 
xt + y = z* would contradict the result of Theorem 3.1. Hence 


Corollary to Theorem 3.1 


The Diophantine equation x* + y* = z4 has no positive solutions. 
y 


The method of infinite descent used in the proof of Theorem 3.1 can be 
applied to many Diophantine equations, almost invariably with the purpose 
of showing that the equation has no positive solutions. Here is a simpler 
example. 


Example 3.1 
Show that the Diophantine equation x? + 3y? = 9z? has no positive 
solutions. 


Aiming for a contradiction suppose that zł + 3y} = 9z}, where x1, yı and 21 
are positive integers. As the prime 3 divides the right-hand side and one of 
the terms on the left of this equation, it must divide the remaining term. 
That is, 3 divides zł and so 3 divides xı. Substituting zı = 3x2 in the 
equation gives 


2723 + 3y3? = 923 
and, dividing throughout by 3, 

9r3 +y? = 3z. 
Repetition of the same reasoning now shows that 3 divides yı and putting 
Yı = 3y2 the equation becomes 

9x3 + 27y3 = 323; that is, 323 + 9y = 23. 


This time we have 3 dividing 21, and putting zı = 3z2 the equation now 
becomes 


323 + 9y3 = 2723; that is, z3 + 3y3 = 923. 


At this point we have reached a second solution, £x = £2, Y = Y2, Z = 22 of 
the original equation with z2 < z1. The descent step is therefore complete 
and the required contradiction is established. Thus z? + 3y? = 9z has no 
positive solutions. + 


Problem 3.1 


Show that the Diophantine equation z4 + 2y4 = 424 has no positive 
solutions. 


The following problem requires a slight variant on Fermat’s method of 
infinite descent. 
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Problem 3.2 

(a) Show that if £? + y? + z2? = 0 (mod 4) then each of z, y and z is even. 

(b) Show that in any positive solution x = x1, y = y1, Z = 21 of the 
Diophantine equation x? + y? + z? = 2xyz, each of x1, yı and z; must 
be even. Putting xı = 222, yı = 2y2 and zı = 229, show that x2, y2 and 
Z2 are also even positive integers. 

(c) Prove that the Diophantine equation x? + y? + z? = 2xyz has no 
positive solution by constructing from such a solution a strictly 
decreasing infinite sequence of even positive integers. 


3.3 Related Diophantine equations 


Fermat went on to apply his method of infinite descent to prove that the 
related equation z4 — y* = z? has no positive solution. However we shall 
first establish Theorem 3.2 and then deduce Fermat’s result as a corollary. 


Theorem 3.2 


The Diophantine equation «4 + 4y* = z? has no positive solution. 


Proof of Theorem 3.2 


Suppose that x1, yı and zı are positive integers with x} + 4y} = z?. Exactly 
as in the proof of Theorem 3.1 we may assume that gced(x1, y1) = 1, for 
otherwise cancellation of any common divisor leads to a smaller solution of 
the same equation. 


With infinite descent in mind, we wish to show that the existence of this one 
positive solution necessarily gives rise to a one with a smaller z-value. We 
observe that xı and zı are of the same parity, and we first consider the case 
in which they are both even. Substituting zı = 22’ and z1 = 22’ gives 


(2a’)* + 4y# = (22')?; that is, y# + 4r" = z”? 


which is another positive solution of the equation with z’ < z. The solution is £ = y1, y= 2’, 
1 

It remains to show that a solution in which x, and zı are both odd must + Es 

likewise lead to another with smaller z-value. Writing the equation as 

(x?)? + (2y?)? = 2? we recognize that (2y?, «?, z1) is a primitive Pythagorean 

triple and so, according to Theorem 2.1, there are relatively prime positive 

integers m and n of opposite parity and with m > n such that 


2y? = 2mn, ge=m—n?, 2 =m? Hnr. 


From z? = m? — n? we see that m is odd and n even, for the other way 


round would give m? — n? = 3 (mod 4) and this cannot be a square. And 
from y? = mn, the relatively prime integers m and n are each squares and so 
we can write m = a? and n = (2b)?. 


2 2 


Looking again at x? = m? — n? with gcd(m,n) = 1, we now see that 

(n,xı, m) is another primitive Pythagorean triple, and so there exist The triple is primitive because 
relatively prime positive integers s and t, of opposite parity and with s > t gcd(m, n) = 1. 

such that 


n=2st, 213=s*-t?, m=s?+??. 


Substituting n = (2b)? into the first of these equations gives 2b? = st. Now 
one of s or t is even. If it is s, say s = 2r, then b? = rt and as r and t are 
relatively prime each is a square; say r = u? (so that s = 2u?) and t = v?. 
The alternative that t is even leads, in exactly the same way, to s = v? and 
t = 2u?. In either case, substituting into m = s2 + t? gives 


v4 + 4ut = 8? +0? =m=a?. 
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Thus we have another solution of the original equation. Moreover, putting 
z2 = a we have 


0<z2=a<m<m +n =z, 
and so the z-value in this new solution is a smaller positive integer. 


This completes the descent step and the proof. a 


Corollary to Theorem 3.2 


The Diophantine equation «* — y4 = z? has no positive solution. 


Proof of the Corollary 
Suppose that xı, yı and zı are positive integers with x} — yf = z?. Squaring 
both sides of this equation and rearranging gives 

zi + 4(a1m1)* = (zi + yt). 
Thus z£ = 21, y= 2191, Z = TÍ + y} is a positive solution of z4 + 4y4 = 2”. 
But this contradicts Theorem 3.2 and so the equation x* — y* = z? has no 
positive solution. E 


We finish with a problem for you to attempt. The solution to this problem 
first appeared in the margin of Fermat’s copy of Diophantus. 


Problem: 33 n G re eS 
Let (x,y,z) be a Pythagorean triple. The area of the associated right-angled 
triangle is 3? and for this to be a square zy = 2n? for some integer n. 


Show that if there is a positive simultaneous solution to this equation and to 
x? + y? = 2”, then there is a positive solution of at — b+ = c?, and hence 
deduce that no Pythagorean triangle can have an area which is a square. 


4 SUMS OF SQUARES 


4.1 Representing primes as sums of two squares 


Another problem which attracted Fermat’s attention concerned ways of 
expressing positive integers as sums of squares. For example, the integers 
from 1 to 9 can be written as sums of squares as follows. 


1=1? 
2=17+1? 
Eos ee ee 
4=?? 

5= 2741? 


6= 2741741? 
7=2741741°4+1? 
8=2 +2" 
os 
The expressions given are certainly not unique; for example we could write 


se PEP 4% 4 of 8a Fe 
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But notice that the given expression for 7 as a sum of squares is the best we 
can do in the sense that 7 cannot be expressed as a sum of fewer than four 
squares. If you try continuing the list that we have started, with the aim of 
expressing each integer as a sum of as few squares as possible, you should 
find that each requires no more than four squares. 


Amongst other questions that these observations might suggest are the 
following. 


e Which integers can be expressed as a sum of two squares? We accept 0 as a square so that, for 
example, 4 = 2? + 0? is a legitimate 
way of expressing 4 as a sum of two 
squares. However we ignore 

e Which integers can be expressed as a sum of three squares? negative integers since (—a)? = a?. 


e Which integers can be expressed as a sum of two squares in a unique 
way? 


e Can all positive integers be expressed as a sum of four squares? 


Fermat appears to have solved the first two of these problems. In letters to 
Mersenne he claimed to have a proof, using his descent method, of the key 
step that every prime of the form 4k + 1 is expressible as a sum of two 
squares in a unique way. But yet again Fermat did not leave a copy of his 
proof and the mathematical world had to wait until 1747, when Euler 
provided one. 


The first serious contribution to the third question belongs to Diophantus 
who conjectured that no number of the form 8k + 7 can be expressed as a 
sum of three squares. Fermat appears to have been first to write down exact 
criteria for a number to be the sum of three squares, namely that the 
number must not be of the form 4"(8k + 7) for non-negative integers k 

and n. Proof of this was provided by Legendre in 1798. 


Having discovered which integers can, and which cannot, be expressed as a 
sum of two squares, and which can, and which cannot, be expressed as a 
sum of three squares, is it worth proceeding to solve the analogous problem 
for four, five, six, ... squares? Well yes it is, because at the next stage the 
sequence of investigations reaches a conclusion when we uncover the classic 
result that every positive integer is a sum of four squares. It is believed that, 
from the way he posed his questions in this area, the result was probably 
suspected by Diophantus, but it was first expressed formally by Bachet in 
1621. Shortly after this, Fermat tackled the problem and (surprise, surprise) 
he claimed that he had a proof which used his descent method. Euler made 
various attempts at it over a period of more than 40 years but without 
success, which shows just how difficult it is. Eventually the four-square 
conjecture was proved in 1770 by Lagrange who acknowledged that ideas 
originating from Euler played a substantial part in his proof. 


First we shall investigate the two-square problem. The following identity, 
which is easily established, is going to be crucial in what follows. 


Important identity for two squares 


(a? + b*)(c? + d?) = (ac + bd)? + (ad — be)? 


This identity is of theoretical importance because it tells us that if two 
positive integers m and n can each be written as a sum of two squares then 
so too can their product mn. This means that to show that a given integer 
can be written as a sum of two squares it is sufficient to show that each 
prime in its decomposition can be expressed this way. 


But the identity also has a practical use, as illustrated by the following 
example. 
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Example 4.1 
Express 4420 as a sum of two squares. 
4420 = 2? x 5 x 13 x 17 = (2? + 07)(2? + 1)(3? +A E + 1”) 
Using the identity three times: 
4420 = (4? + 27)(3? + 27)(4? + 17) 
= (16? + 2?)(4? + 1?) 
= 667 + 87. 
This answer is not unique. It turns out that there are four ways of writing 
4420 as a sum of two squares. The other ways can all be obtained from the 
identity applied to different terms. We could change the order of the 
bracketed terms, or switch the two terms within a bracket, or make use of 
negative values for a, b, c or d. Each of these variations is illustrated below 
where we discover the other three solutions. 
4420 = (42 + 27)(42 + 17)(2? + 3?) 
= + 27)(11? + 10?) = 64? + 18? 
4420 = (47 + 27)(11? + (—10)”) = 24? + (—62)? = 62? + 24? 
4420 = ((4? + 17)(2? + 17))((2? + 07)(2? + 3?)) 
= (9? + 27)(4? + 67) = 48? + 46? + 


Problem 4.1 


Find three ways of expressing 325 as a sum of two squares. 


Turning to the question of whether a positive integer n can be expressed as 
a sum of two squares, our important identity guides us to look at the primes 
occurring in its decomposition. Now primes can be classified into three 
types: 

e the even prime 2 = 1? + 1?, which is a sum of two squares; 

e odd primes of the form 4k + 1; 

e odd primes of the form 4k + 3. 


The third category presents little problem. As a square is congruent modulo 
4 to either 0 or 1, a sum of two squares is congruent modulo 4 to one of 0, 1 
or 2. So no prime of the form 4k + 3 can be expressed as a sum of two 
squares. 


That leaves just the middle category. We shall prove that all primes of the 
form 4k + 1 can be written as a sum of two squares. It will then be a simple 
matter to complete the two-square problem. The result was first stated and 
proved by Fermat, and our proof follows his, using the descent method, but 
in a different way from our previous applications. 


Theorem 4.1 Primes expressed as a sum of two squares 


A prime p can be expressed as a sum of two squares if, and only if, 
p = 2 or p=1 (mod 4). 


All that remains to be done is to show that any prime p = 1 (mod 4) can be 
expressed as a sum of two squares. Before we present the proof formally let 
us explain how we are going to use the descent method. We start by 
assuming that some multiple of p can be expressed as a sum of two squares. 
More precisely, we assume that 


mp = 27 + y? 
has a solution for some 1 < m < p. If m = 1 we have the required expression 


for p as a sum of two squares. If m > 1, we go on to deduce from the above 
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equation that a smaller multiple of p can be expressed as a sum of two 
squares, that is, 


mip = u? te v’, 


where 1 < mı < m. This is our descent step. The difference this time is that 
we shall go on to show that the equation mp = x? + y? really does have a 
solution with 1 < m < p. The descent step tells us that from this solution 
there is then a smaller solution, and from this a smaller one, and so on. The 
essence of the descent method is that such a process cannot go on forever, 
and so must terminate. The only way it can terminate is by descending to a 
solution with m = 1 (so that the descent step cannot be applied again). 
Hence the existence of a solution to mp = x? + y? with 1 < m < p leads to 
the conclusion that p = x? + y? must have a solution. 


Now for the details. 


Proof of Theorem 4.1 


Suppose that the equation mp = x? + y? has a solution in which 1 < m < p. 
Let u and v be the least absolute residues modulo m of x and y respectively. 


That is, 

u=x(modm), v=y (mod m), -5 <uuss- 
Then 

u? +v? =x? +y? =0 (mod m), 
and so 


u? +u? =mr, for some integer r > 0. 


To establish the descent step we aim to show that rp is a sum of two squares 
with 1 < r < m. First we check that r does lie in this range. 


If r = 0 then u = v = 0 implying that m divides both x and y. But then, 
from mp = x? + y?, we conclude that m divides p, which is plainly 
impossible. So 


i< ee mm m 
mE — — + — |= <m. 
= Tt =. 4 4 2 


Now we must show that rp is a sum of two squares. 


Multiplying together the equations mp = x? + y? and mr = u? +v?, gives 
mrp = (x? + y?) (u? +v?) = (zu + yr)? + (xv — yu). 

Now 
ru + yv = r? +y? = 0 (mod m), 

implying that m divides xu + yv, and 
zv — yu = ry — czy = 0 (mod m), 


implying that m divides xv — yu. Putting xu + yv = mX and 
xv — yu = mY leads to 


mrp = m? X? +m?Y?; that is, Tp-= LEPNE We can, of course, replace X by 
t —X or Y by —Y if necessary to 
This completes the descent step. replace negative integers. 


It remains to show that mp = x? + y? has a solution for some m with 

1 <m < p. Property (e) of Theorem 2.1 of Unit 6 provides this solution 
since it tells us that —1 is a quadratic residue of each prime p = 1 (mod 4). 
Consequently the congruence x” + 1 = 0 (mod p) has a least positive 
solution zı with 0 < x; < p—1. So there exists a positive integer m such 
that 


mp = 27 + 1?, 


27: 


which is exactly as required since 


7+ 1? —1)}? +1  p?-2p-1 
pee e N 
p p p 


Now if this solution has m > 1 then the established descent step guarantees 
a solution with smaller, positive value of m. We descend through such 
smaller solutions until we reach one with m = 1; that is, a solution of 
p=r +y. m 


The descent step in the above proof may have read like a theoretical 
argument but, in fact, it contains a construction for getting from an 
expression of one multiple of p as a sum of two squares to a similar 
expression for a smaller multiple of p. We can see how the descent step 
works by looking at a particular example. 


Example 4.2 


Given that 60? + 1? = 13 x 277, retrace the proof of the descent step in 
Theorem 4.1 to express the prime 277 as a sum of two squares. 


From 
13 x 277 = 60? + 1? 


we have p = 277, m = 13, x = 60 and y = 1. So u = 60 (mod 13) and 
v = 1 (mod 13) and as u and v are least absolute residues modulo 13 we 
have u = —5 and v = 1. Therefore 


13r = (—5)? + 1?, 
so that r = 2. Multiplying these two equations together: 
2 x 13? x 277 = (607 + 17)((—5)?: +17) 
= (—299)? + 652, 


and on dividing both sides by 13? and removing the minus sign from inside 
the square, 


22 =F" 5". 
This completes the first descent step. 


As we have not reached 277 itself we descend again. Now m = 2, x = 23 and 
y = 5 and, replacing x and y by their least absolute residues modulo 2, 
i — to ety 


u+ = 1241? = 2r 
gives r = 1. Multiplying the two equations together 

2? x 277 = (237 + 57)(1? + 1) = 28? + 18?, 
and on dividing both sides by 2?, 

277 = 147 + 9?. 
We have thus expressed 277 as a sum of two squares. 4 
Knowing that a prime p = 4k + 1 can be expressed as x? + y?, 
mathematicians understandably took up the challenge of finding ways of 
constructing the integers x and y in terms of the prime p. Several such 
constructions have emerged, but none of them is particularly easy to employ. 


The first, due to Legendre, showed how to obtain x and y from the 
continued fraction of ,/p. One disadvantage of this method is that we first 
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have to obtain the continued fraction of ,/p which can be no mean task 
when p is large. The next, due to Gauss, is easy to state. 


Let prime p = 4k + 1, and let x and y be least absolute residues 


2k)! 
modulo p satisfying x = aa (mod p) and y = (2k)!xz (mod p). 


Then z? + y? =p. 


This is very neat, but try using it to express 277 (k = 69) as a sum of two 
squares! We shall not be proving it either. 


For practical purposes the method of exhaustion offers as good a method as 
any for expressing moderately sized primes as a sum of two squares. For 
instance, look again at the prime 277. First note that if 277 = x? + y? then 


200 
one of x? or y? is less than ——, while the other is greater than Ta By 
virtue of Theorem 4.1 we need only check 


EN 6 E SN sal 


until we find a square, in the certain knowledge that one will turn up by the 


: 277 
time we reach 277 — 12?, since 12? > ——. In fact 277 — 9? = 14?. This method finds just one solution 
2 but, as we shall see in a moment, 
To represent a composite number as a sum of two squares we can combine there is only the one solution. 


the suggested exhaustive approach with use of the important identity. 


Problem 4.2 


Express 5321 = 17 x 313 as a sum of two squares. 


4.2 Sums of two squares, completed 


We have seen several examples of integers which can be expressed as a sum 
of squares in more than one way. This is not true of primes. No prime of the 
form 4k + 3 can be expressed as a sum of two squares whilst each prime of 
the form 4k + 1 can be expressed as a sum of two squares in a unique way. 
The uniqueness assumes, as we have been doing all along, that by squares 
we mean squares of non-negative integers, and also that the sum of squares 
x? +y? is regarded as the same expression as y? + 2”. 


Before we start the proof, note that if p = £? + y?, where z and y are 
positive integers, then each of x and y lies strictly between 0 and „/p; neither 
can be 0 for otherwise the prime p would be a square. Moreover 

gcd(x, y) = 1, as any common divisor of x and y must divide x? + y?; that 
is, it must divide p and it certainly cannot be p itself. 


Theorem 4.2 Uniqueness of representation 


The expression of a prime of form 4k + 1 as a sum of two squares is 
unique except for the order of the two summands. 
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Proof of Theorem 4.2 


Suppose that p = a? + b? = c? + d?, where a > b > 0 and c > d > 0. We 
must show that a = c and b = d so that the expression for p as a sum of two 
squares is unique. 


From the two expressions for p we have 

Ë — bd? = (p — b?)d? — b? (p — d) = p(d? — b?) = 0 (mod p). 
That is, 

(ad — bc) (ad + bc) = 0 (mod p). 


Now Euclid’s Lemma tells us that either p divides ad — bc or p divides 

ad + bc. We shall show that the former must be the case. To that end 
suppose to the contrary that p divides ad + be. As each of a, b, c and d lies 
strictly between 0 and \/p we have 0 < ad + be < 2p. It must therefore be 
the case that ad + bc = p. But then 


pP = (a? + b*)(d? +c”) = (ad + be)” + (ac — bd)? 
= p? + (ac — bd)? 


so that ac — bd = 0. But since a > b and c > d we have ac > bd and so we 
have reached the required contradiction. 


It follows that p divides ad — bc. Again, since each of the four integers lies 
strictly between 0 and \/p, we have —p < ad — be < p, and consequently 

ad = bc. From this, a divides bc and, since ged(a, b) = 1, a divides c. Putting 
c = ka the equation ad = be becomes d = kb, and then 


p=O+d = (a? +0") = kp. 


This implies that k = 1 and, in turn, leads to a = c and b = d, exactly as 
required. a 


We now know exactly which primes can be expressed as a sum of two 
squares, namely 2 and any prime of the form 4k + 1, and we know that any 
product of numbers expressible as a sum of two squares is itself so 
expressible. Hence any number which has no prime divisor of the form 

4k + 3 is certainly expressible as a sum of two squares. But are these the 
only ones? A little experimentation reveals that they are not. For example, 
18 = 3? + 3? is a sum of two squares which does not adhere to the above 
prescription because it has a prime divisor of the form 4k + 3, namely 3 
itself. The essential point with 18 is that it is immaterial that 3 cannot be 
written as a sum of two squares because 3? divides 18 and 


18 = 2x 3? = (17 + 1*)3? = 3? +37. 
In general the identity 
e(a? +b?) = (ca)? + (cb)? 


shows that we can multiply any sum of squares by any square and retain a 
sum of squares. We can build on this observation in the following way. 
Suppose, for example, that we want to find one way of expressing 

27 x 34 x 5 x 13° as a sum of squares. By first isolating as large a square 
term as possible we can write 


2” x 34 x 5 x 13° = (28 x 3? x 13)? x (2.x 5 x 13) 


and the task will be completed when we express the square-free part, 
2 x 5 x 13, as a sum of two squares. 


(2? x3? x 15)" xo x3 x13 


= (23 x 3? x 13)? x (17 + 1°)(1? + 27)(2? + 3?) 
= (23 x 3? x 13)? x (17 + 17)(8? + 1”) 
= (2° x 3? x 13)? x (9? + 77) 

(2° x 3* x 13)? + (2° x 3? x7 x 1377 
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Remember that a square-free 
integer is one which is not divisible 
by the square of any prime. Every 
positive integer is square, 
square-free or can be expressed as 
a square multiplied by a 
square-free integer. 


We shall make use of this idea in our proof of the following result which 
gives a complete classification of which integers are sums of two squares. 


Theorem 4.3 Sums of two squares 


A positive integer n can be expressed as a sum of two squares if, and 
only if, each of its prime divisors of the form 4k + 3, if any, occurs to 
an even power. 


Proof of Theorem 4.3 


By pulling out the largest square divisor of n we can write n = m?r, where r We allow the possibility r = 1. 
is square-free. The theorem asserts that n is expressible as a sum of two 
squares if, and only if, r is not divisible by any prime of the form 4k + 3. 


To prove the ‘if, and only if,’ assertion we have to establish the implications 
both ways, so we break the proof into two parts. 


(a) Suppose that r has no prime divisor of the form 4k + 3. If r = 1 then 
n =m? + 0? and there is nothing to prove. If r > 1 then r is a product 
of one or more primes each of which is either 2 or of the form 4k + 1. 
We have seen that such a product r can be expressed as a sum of two 
squares, so n = m?(a? + b?) = (ma)? + (mb)?. 


= 
o> 
<~ 


Suppose that n can be expressed as a sum of two squares, say 
n=m?r =a? +b. 


First, any common divisor of a and b may be cancelled as follows. If 
gcd(a, b) = d then we can write a = ad, b = bid, where gcd(aj, b1) = 1, 


and 
m?r = d?(a? +b). 
As r is square-free d divides m and so, writing mı = — Any prime which divides d occurs 
d with exponent 2, or more, on the 
mer = a? $ b. right-hand side of this equation 


and so must divide m. 
Our task is to show that r does not have a prime divisor of the form 
4k + 3. Aiming for a contradiction suppose that the prime p = 4k + 3 
divides r. Then 
a? +b? =0 (mod p); that is, a? = —b? (mod p). 


Now if p divides a; we would have p divides b; thereby contradicting 
gced(a1,b;) = 1. So ged(a, p) = ged(bi, p) = 1 and FLT can be applied 
giving 


Putting p = 4k + 3, 
{ena gi = (gt = (3 
= (—1)?*+1(ef)*+ = (—1)6f"? = —1 (mod p). 


This cannot possibly be true for an odd prime p and so we have a 
contradiction, and the result follows. a 


Problem 4.3 


Which integers in the range 1995 to 2005 inclusive can be expressed as a 
sum of two squares? For any which can be so expressed find one such 
representation. 
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Problem 4.4 


Use Theorem 4.3 to show that no positive integer which is congruent 
modulo 9 to either 3 or 6 can be expressed as a sum of two squares. For each 
of the other seven congruence classes modulo 9 find the smallest positive 
integer which can be represented as a sum of two squares and the smallest 
which cannot. 


4.3 Sums of three and four squares 


For the sake of completeness we include a little more about the three-square 
and four-square problems. Finding which integers can, and which cannot, be 
expressed as a sum of three squares is not difficult but, as so often happens 
in number theory, finding a proof of an assertion for which there is 
overwhelming numerical evidence is a different matter. We can readily 
discover, and prove, which numbers cannot be expressed as a sum of three 
squares, the difficulty arises in proving that all other numbers can be so 
expressed. 


Theorem 4.4 Sums of three squares 


A positive integer can be expressed as a sum of three squares if, and 
only if, it is not of the form 4” (8m + 7) for some n > 0, m > 0. 


Proof of Theorem 4.4 


We shall give here only the easier half of the ‘if, and only if,’ proof, namely 
that no number of the stated form can be expressed as a sum of three 
squares. 


The squares modulo 8 are 0, 1 and 4, and consequently a sum of three 
squares can be congruent modulo 8 to any of the values 0, 1, 2, 3, 4, 5 or 6, 
but not to 7. So no number of the form 8m + 7 can be a sum of three 
squares. 


Now suppose that for some n > 1 and m > 0 we have 
4"(8m +7) =r? +y +27. 
As the left-hand side is congruent modulo 4 to 0, and as squares modulo 4 


are either 0 or 1, it has to be the case that x, y and z are all even. Putting 
x = 221, y = 2y; and z = 2z, we get 


4°-1 (8m +7) = a2? + y? + 2?. 


If n— 1 > 1 then x, yı and z, are still even and the argument can be 
repeated: 


42 (8m +7) = 23 + y2 + 22. 


In this way we descend through powers of 4 until 8m + 7 itself is expressed 
as a sum of three squares. But that is impossible, so the assumption that 
4” (8m + 7) can be expressed as a sum of three squares must be false. a 


Problem 4.5 


Which, if any, of the following numbers can be expressed as a sum of three 
squares? For any which can, find such a representation. 


(a) 39 (b) 56 (c) 448 (d) 10! 
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As with sums of two squares we 
allow 0? so that, for example, 

1? + 1? + 0? is a representation of 2 
as a sum of three squares. 


This is similar to the method you 
met in Problem 3.2 


One main reason for the difficulty in proving the second part of Theorem 4.4, 
namely that any integer not of the given form can be expressed as a sum of 
three squares, stems from the fact that there is no equivalent of our 
important identity. It is not true that if two numbers can each be expressed 
as a sum of three squares then so too can their product. For example, 


3=174+174+1? and 5=2?+17+0? 
but the product of these numbers, 15, is not a sum of three squares. 
Ironically, when we progress to the four-square problem the equivalent of our 


important identity re-emerges. There is an identity expressing the product 
of two sums of four squares as a sum of four squares. Here it is. 


Important Identity for four squares 


(a? + b? + c? + d?)(w? + a? + y? + 2?) 


= (aw + bx + cy + dz)? + (ax — bw + cz — dy)? 
+ (ay — bz — cw + dz)? + (az + by — cx — dw)? 


With the benefit of this identity the four-square problem essentially comes 
down to the determination of which primes can be written as a sum of four 
squares. 


Theorem 4.5 Lagrange’s Four Square Theorem 


Every positive integer can be expressed as a sum of four squares. 


As indicated above, to prove Lagrange’s Theorem it is sufficient to show 
that each prime can be expressed as a sum of four squares. The even prime 
2 = 1? + 1? +0? + 0? can certainly be expressed this way. For the odd 
primes there is a very similar argument to our proof of Theorem 4.1 which 
we could use. Suppose that some multiple mp of the odd prime p can be 
expressed as a sum of four squares, say 


mp=a?+?+c+4+d?, 1<m<p. 
If m = 1 we have the required expression. If not, careful algebra allows us to 
‘descend’ to a smaller multiple which is also a sum of four squares, 
mp=0 + +e +E, 
where 1 < mı < mM. 
The final stage is to show that there really is an appropriate multiple of p 
which is a sum of four squares, so that from this multiple we can descend in 


a finite number of steps to p itself being a sum of four squares. The details 
of the descent step are quite intricate and we shall omit the proof here. 


Having established that four squares suffice to represent any non-negative 
integer, a natural follow-up question is to ask how many cubes, fourth 
powers, fifth powers, ... are needed. In 1770 Waring proposed the following 
conjecture. 


Theorem 4.6 Waring’s Problem 


For each integer k > 2 there exists a positive integer g(k) such that 


every positive integer can be expressed as a sum of at most g(k) kth 
powers. 


The assertion is that for each k a number g(k) exists, but it carries with it 
the associated problem of finding values for g(k). The amount of research 
that has gone into the investigation of various values of g(k) have made 
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Waring’s Problem one of the most actively pursued areas of Number Theory. 
In 1909 Hilbert proved Waring’s original conjecture. For each k > 2 the 
number g(k) does exist. But Hilbert’s proof was highly theoretical and 
offered little insight into the values of g(k). 


Lagrange’s Four Square Theorem confirms that g(2) = 4. This was known to 
Waring, who went on to claim that g(3) = 9 and g(4) = 19, both of which 
have now been shown to be true, the latter only very recently. The former is 
claiming that every positive integer can be expressed as a sum of at most 
nine cubes. In fact there are just two such integers which require nine cubes: 


23 = 2 + 2° + 1? +1? 41° +19 +19 41° +19 
and 
239 = 4° + 4° + 3° + 39 + 39 +394 1° +19 H. 
All integers exceeding 239 can be expressed as a sum of at most eight cubes, 


and it has been shown that only finitely many of them do require eight 
cubes, so that from some point onwards seven cubes will suffice. 


Back in 1772, J.A. Euler, son of Leonhard, discovered a lower bound for g(k): 


g(k) = int (3) ) +2 9. 


We shall not pause to do so here, but it is not difficult to prove that this 
inequality holds. What is surprising is that this easily obtained lower bound 
turns out to give the true value of g(k) for all k so far verified, and very 
likely for all k. It is now known that g(k) = int ((3)") + 2* — 2 for all 


2 < k < 200000, and that there are at most a finite number of exceptions 
after this point. 


Diversion 


The English number theorist G.H. Hardy tells the following story concerning his 
young protégé Ramanujan: I remember going to visit him in Putney hospital. I 
had travelled there in taxi cab Number 1729 and I remarked that the number 
seemed a rather dull one to me; I hoped it was not an unfavourable omen. He 
replied: ‘On the contrary it is a very interesting number; it is the smallest number 
which can be expressed as the sum of two cubes in two different ways’! 


ADDITIONAL EXERCISES 


Section 1 


1 Find two positive solutions of each of the Diophantine equations 
(a) 2?-14y2?=1 and (b) a2? - 18y? =1. 


2 Find two positive integer solutions of each of the equations 


x? —17y* = +1. 


3 Show that if z = 21, y = yı is a solution of x? — ny? = —1 then 
xz = 2x? +1, y = 221y; satisfies x? — ny? = 1. Use this fact to find a 
solution of x? — 74y” = 1, given that v74 = [8, (1,1, 1, 1, 16)]. 
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You might like to check that what 
the identity gives for k = 2, 3 and 4 
agrees with what we have already 
found. 


(a) Show that the equation x? — ny? = —1 has no solution if 
n = 3 (mod 4). 

(b) Show that if z? — ny? = m has solutions, where m and n are 
relatively prime, then m is a quadratic residue of each odd prime 
divisor of n. 


Confirm that the equation x? — 34y? = —1 gives a counter-example 
to the converse of this result. 


Find three positive solutions of x? + 2xy — 2y? = 1. 


The number 48 has the curious property that if 1 is added to it the 
result is a square (49), whilst if 1 is added to half of it the result is also 
a square (25). Find two more positive integers with this property. 


Section 2 


1 


If (x,y,z) is a primitive Pythagorean triple prove that each of x + y 
and xz — y is congruent modulo 8 to either 1 or 7. 


Find all Pythagorean triples for which the associated right-angled 
triangle has its area numerically equal to its perimeter. 


Show that if (x,y,z) is a Pythagorean triple then at least one of x or y 
is divisible by 3, at least one of x or y is divisible by 4 and at least one 
of x, y or z is divisible by 5. 


Section 3 


1 


m 
Suppose that V5 = —, where m and n are positive integers. Show that 
n 


—2 
V5 = Sem, Deduce from this, using the method of infinite descent, 


that /5 is irrational. 


Show that it is impossible to find four positive integers which have the 
property that the sum of their squares is divisible by twice their 
product. Use the the following method. 


(a) Consider the Diophantine equation 
w +a? +y? + 2? = 8kwryz. 


Show that, in any solution, each of w, x, y and z must be even. 
Hint: Consider the two sides of the equation modulo 8. 
Then use infinite descent to prove that this equation has no 
positive solutions. 

(b) Deduce that the Diophantine equation 


w +r? +y? + 2? = Qwryz 
has no positive solutions. 
The Diophantine equation z4 — y+ = 2z? can be shown to have no 
solution in which x and y are both odd. Assuming this fact, use the 


method of infinite descent to show that this equation has no positive 
solution. 


Show that the Diophantine equation z? + y? = xy? has no positive 
solutions. 


Note that /34 = [5, (1,4, 1, 10)]. 
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Section 4 


Express each of the numbers 245, 260 and 245 x 260 as a sum of two 
squares. 


If m and n can each be expressed as a sum of two squares and m 

divides n, is it necessarily true that — can be expressed as a sum of 
m 

two squares? 


Either prove or give a counter-example, as appropriate. 


Show that if p = 4k + 1 is prime then 2p can be written as a sum of two 
squares in a way which is unique apart from the order of the summands. 


Find the three smallest integers greater than 1000 which cannot be 
expressed as a sum of three squares. 


Show that the number n can be expressed as a sum of three triangular 
numbers if, and only if, 8n + 3 can be expressed as a sum of three 
squares. Hence deduce that every positive integer is a sum of three 
triangular numbers. 


Challenge Problems 
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a 
b 
exactly one of the equations z? — ny” = k, where |k| < 1+ 2/n. 


Prove that if — is a convergent of yn then x = a, y = b is a solution of 


Let x = £k, Y = Yk be the successive solutions of the equation 
x? — ny? = 1, with x = z1, y = yı being the fundamental solution. 
Show that 

Tk+1 = 221k — Tk-ı and Yk+ı = 2L1Yk — Yk-1, 


for all integers > 2. 


Show that 169 can be expressed as a sum of one, of two, of three and of 
four non-zero squares. Hence show that every integer greater than 169 
is a sum of five non-zero squares. 


Which positive integers cannot be expressed as a sum of five non-zero 


squares? 


Prove that the Diophantine equation z? + y? = w? + z? does not have a 
positive solution in the case where y is odd and has no prime divisor of 
the form 4k + 3 and z is congruent modulo 4 to 2. 


Hence show that z? = y? +7 has no solution. 


Solutions to the Problems 


SOLUTIONS TO THE PROBLEMS 


Solution 1.1 
(a) The convergents of of v3 = [1, (1, 2)] begin as shown in the following 


table. 

p* [ifetsfals|o|r[s[s|o 
Cm [fee Poe | fo a a 
Pm fff a foe [oe a 


It appears that every even convergent gives a solution of this Pell’s 
equation. 


(b) For v10 = [3, (6)] we have: 


mo PPP. 
Pe Pte fefe fof 
re [ope ran 


Once again the even convergents appear to give solutions of Pell’s 
equation, the first three of which give: 

19? — 10 x 6? = 1; 

721? — 10 x 228? = 1; 

27379? — 10 x 8658? = 1. 


Solution 1.2 
The convergents of V11 = [3, (3, 6)] are 

3 10 63 199 1257 3970 

i’ 3" 19" 60 370 19T 
As v11 has a cycle of even length 2, the even convergents will all give rise to 
solutions of this Pell’s equation. The first three solutions are therefore given 
by the convergents C2, C4 and Ce as: 

o= i, ¢33.- 1 = x2 = 1; 

z=199, y=60, 199? — 11 x 60? = 1; 

x = 3970, y=1197, 3970? —11 x 1197? =1. 
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Solutions to the Problems 


Solution 1.3 
As v13 = [3, (1, 1,1, 1,6)] has cycle of odd length 5, the convergent = 
5 


satisfies p? — 13q¢? = —1 and the convergent 5 gives the smallest solution 
of z? — 13y? = 1. 
The convergents of V13 begin 

3 4 T- 11 18 119 137 =256+-393--649 


PY? 2 -3* 3 SS Se ae 
and we can confirm that: 
187 — 13 x 5? = -1, 
giving x = 18, y = 5 as a solution of x” — 13y? = —1; 
649? — 13 x 180? = 1, 


giving x = 649, y = 180 as a solution of z? — 13y? = +1. 


Solution 1.4 
The first convergent, C1 = $ gives the smallest solution zı = 4, yı = 1 of 
r? —17y? = —1. 
As 
(4+ V17)? = 33 + 8v17, 
we have x = 33, y = 8 as a solution of z? — 17y? = 1. As 
(4+ V17)* = (33 + 8V17)? = 2177 + 528V17, 
we have x = 2177, y = 528 as the next solution of z? — 174? = 1. 


Solution 1.5 


As \/2 = [1, (2)], which has cycle of length 1, every even convergent gives a 
solution of the Pell’s equation. The first ten convergents of v2 are as follows. 


1 3 7 17 41 99 239 577 1393 3363 
1” 2° 5’ -12° 20° 70’ 169° 408" 985° 3378" 
P2k s ; . [Ok\? ; 
For each convergent ——, the required triangular square is (=) . The first 
q2k 


five are: 


2\? 12\” 70\? 408 \? 2378 \” 
3 eee bs eS TORE eiae (2S Start: 
G) n (2) 6, (2) 1225, (=) 11616, ( f ) 37 


Solution 2.1 


(a) (10,8,6) is not a Pythagorean triple; the hypoteneuse must be the 
largest of the three values and, by convention, must be the last member 
of the triple. 


(b) (12,9, 15) is a Pythagorean triple, but is not primitive since it is the 
(4, 3,5) triple scaled by a multiple of 3. 


(c) (6,7,8) is not a Pythagorean triple : 6? + 7? 4 8?. 


(d) (5,12, 13) is a primitive Pythagorean triple: 5? + 12? = 169 = 13? and 
the three integers are relatively prime in pairs. 


(e) (24,33, 41) is not a Pythagorean triple since 24 and 33 have a common 
divisor, namely 3, which does not divide the third number. 
Solution 2.2 


We seek relatively prime positive integers m and n, of opposite parity and 


with m > n, such that 2mn = 72. There are two such pairs of integers: The pairs m = 18, n = 2 and 
ae ; m = 12, n = 3 are excluded 
m = 36, n = 1, giving the triple (72, 1295, 1297); because they are not relatively 
m = 9, n = 4, giving the triple (72, 65, 97). prime. 
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Solutions to the Problems 


Solution 2.3 


One possible strategy is to look at each divisor of 30 in turn and to 
determine all primitive triples containing that divisor. But notice that, since 


30 is not a multiple of 4, no even divisor of 30 can occur in a primitive Remember that the even member 
Pythagorean triple. So we need look only at the odd divisors of 30. Any = z primitive triple is divisible 
y 4. 


primitive triple containing one of 3, 5 or 15 may be scaled appropriately to 
give a Pythagorean triple containing 30. 

We therefore seek all pairs of relatively prime integers m and n of opposite 
parity and with m > n such that either m? + n? or m? — n? is equal to one 
of 3, 5 or 15. It turns out that there are just five such pairs, as presented 
below. 


Divisor m n Primitive triple Scaling Triple with 30 


44 (8, 15, 17) 2 (16, 30, 34) ear 

15-8 F — (119515418) 2 (224, 30, 226) As m“ — n” = (m—n)(m + n) we 
5 3 2 (12, 5, 13) 6 (72, 30, 78) need only consider m and n for 

= oe eee. 6 (24, 18, 30) ee 

3 o-4 (4, 3,5) 10 (40, 30, 50) 


Solution 2.4 


Suppose that st = u?, where s and t are relatively prime. Let s and t have 
prime decompositions s = pe ph? = pr, t= te eng SS pj As 
gced(s,t) = 1 the listed primes are all distinct and so st has prime 


decomposition 
ki ki k 3 
st = pe ihe Pigg es Pj= The primes are not necessarily 


F š : oe listed in ascending order here. 
As st is a square, each exponent in this expression is even. Hence each 


exponent in s, and each exponent in t, is even and so these two are squares 
as well. 


Solution 2.5 


Suppose that (x, x + d, x + 2d) is a Pythagorean triple. Then To fit our convention the triple 
would actually be (x + d, xz, x£ + 2d). 
a? + (x +d)? = (£ + 2d)? zbe ) 


giving 

z? — 2rd — 3d? = 0. 
This factorizes as 

(x — 3d) (x + d) = 0. 


Hence x = 3d or x = —d. As the latter cannot lead to positive solutions we 
are left with x = 3d, which gives the triple (3d, 4d, 5d). The only 
Pythagorean triples which have the three numbers in arithmetic progression 
are the multiples of (3, 4,5). 


Solution 2.6 
Suppose that (m, m + 1, n) is a Pythagorean triple, so m? + (m +1)? = n?. 
Then 


> 2 
Thy + Tinn = (ERED) p (Gt Hm +2)) 
= (2m + 1)?(m? + (m + 1)?) = ((2m + 1)n)? 


which shows that (T2m,T2m+1,2mn + n) is a Pythagorean triple. Knowing 
that there are infinitely many primitive Pythagorean triples (m,m + 1,n) we 
conclude that there are infinitely many whose shorter sides are consecutive 
triangular numbers. 
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Solutions to the Problems 


Solution 3.1 
Suppose that zł + 2y# = 424, where z1, yı and zı are positive integers. As 
the prime 2 divides two of the terms it must divide the third, namely x}. So 
2 divides zı and we can write x, = 2x2 for some positive integer z2. 
Substituting for xı gives 

1625 + 2y% =4z?; that is, 814 + yf = 2zi. 
Repeating the argument, we can write yı = 2y2 for some positive integer y2 
and obtain 

825 + 16y4 = 221; that is, 414 + 8y3 = 27. 
We can now write z1 = 2z2 for some positive integer z2 and 

4r3 + 8y3 = 1623; that is, 23 + 2y% = 423. 


At this point we have reached a second positive solution, £ = £2, Y = Y2, 
z = z2 of the original equation with z2 < z1. The descent step is therefore 
complete and the required contradiction established. 


Thus x* + 2y* = 424 has no positive solutions. 


Solution 3.2 


(a) As any square is congruent modulo 4 to 0 (if it is even) or to 1 (if it is 
odd) it follows that x? + y? + 2? = 0 (mod 4) can only occur when z, y 
and z are all even. 

(b) Suppose that x? + y? + 2? = 2z1y121. If 21, yı and zı are all odd then 
x? + y? + 2? = 3 (mod 4) while the right-hand side, 27 y; 21, is even. 
Hence at least one of x1, yı and zı must be even. But then 
x? + y? + 2? = 221y121 = 0 (mod 4) and, as we have just seen, £1, 41 
and zı must all be even. 


Writing xı = 222, Yı = 2y2 and zı = 2z2 we have 
(2x2)” + (2y2) + (222)? = 2(2x2)(2y2)(2z2), 
which simplifies to 
T2 + ys + 22 = 4T2y222. 


As the right-hand side is congruent modulo 4 to 0, the first part of the 
question once again gives that x2, Y2, and z2 are even positive integers. 

(c) Continuing from part (b), if we write £2 = 223, y2 = 2y3, z2 = 2z3 we 
obtain 


T3 + y3 + 23 = 8r3y323 


with x3, y3 and z3 being even positive integers. We can continue forever 
in this way, halving the x, y and z values and yet retaining even positive 
integers; for from 


x + ye + ze magi rin Tas 


we note that £n, Yn and Zn are even integers and 


(3) +G)+G)-" @) @)G). 


T z j 
The same argument now confirms that 3? al and Sa are still even 


positive integers. As the sequence 21, 22, z3,... of positive integers 
cannot decrease indefinitely, the method of infinite descent shows that 
the supposed positive solution of the original equation cannot exist. 
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Solutions to the Problems 


Solution 3.3 
From z? + y? = z? and zy = 2n? we have 
(£ +y} =a? +y? + ry = 2 + (2n)? 
and 
(2 —y)? = 2? + y? — Qay = 2? — (2n)?. 
Multiplying these two equations together gives 
(a? — y?)? = z4 — (2n)*. Note x Æ y in any Pythagorean 


triple and x? — y? can be replaced 


This contradicts the Corollary to Theorem 3.2. We conclude that no by y? — 2” if the former is negative. 


Pythagorean triangle can have an area which is a square. 


Solution 4.1 
325 = 5? x 13 = (2? + 17)(2? + 17)(3? + 2?) 
= (2? + 17)(8? +17) = 17? + 6? 
= (1? 4 Ps + 12 Ls 102 +4 152 
= (2? + 17)(2? + 3?)(2? + 17) = (7? + 47)(2? + 17) = 187 + 1? 


Solution 4.2 


As 313 = 1 (mod 4) we know that 313 can be expressed as a sum of two 
squares. Searching the sequence 


313— 1°, _313-—2*, 313 — 37, 
for the first square we find 313 — 12? = 13?. Therefore 
5321 = 17 x 313 = (1? + 47)(12? + 137) = 64? + 357. 


In fact there is just one alternative solution which can be reached by writing The expression for a prime is 
313 as 13? + 12?: unique, but it may not be so for a 
composite integer. 


5321 = (1? + 47)(13? + 127) = 61? + 40?. 


Solution 4.3 
1995 =3x5x7x19 not expressible since 3 = 3 (mod 4) 


1996 = 2? x 499 not expressible since 499 = 3 (mod 4) 

1997 is a prime 1997 = 34? + 292 

1998 = 2 x 3° x 37 not expressible since 3 occurs with odd exponent 
1999 is a prime not expressible since 1999 = 3 (mod 4) 

2000:= 2* x 5° 2000 = 40? + 20? 


2001 = 3 x 23 x 29 not expressible since 3 = 3 (mod 4) 
2002 = 2x 7x 11x13 not expressible since 7 = 3 (mod 4) 


2003 is a prime not expressible since 2003 = 3 (mod 4) 
2004 = 2? x 3 x 167 not expressible since 3 = 3 (mod 4) 
2005 = 5 x 401 2005 = 41? + 18? 


Solution 4.4 


If n = 3 (mod 9) then n = 9k + 3 = 3(3k + 1) for some integer k. Now 
gcd(3, 3k +1) = 1 and so, in the prime decomposition of n, 3 occurs with 
exponent 1. Theorem 4.3 confirms that n cannot be expressed as a sum of 
two squares. 


Similarly, if n = 9k + 6 = 3(3k + 2) then n is divisible by 3 but not by 3? 
and so is not expressible as a sum of two squares. 
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Taking each of the other congruence classes modulo 9 in turn: 


Residue class | Smallest sum of | Smallest not sum 
modulo 9 two squares of two squares 


Solution 4.5 
(a) 39 =7 (mod 8) and so cannot be expressed as a sum of three squares. 


(b) 56 = 4 x 14 is not of form 4"(8m + 7) and so can be written as a sum of 
three squares: 


56 = 67 + 47 + 2?. 
(c) 448 = 4° x 7 cannot be expressed as a sum of two squares. 
(d) 10! = 28 x 34 x 5? x 7 = 44(34 x 5? x 7). Now as 3? = 5? = 1 (mod 8) it 


follows that 34 x 5? x 7 = 7 (mod 8) and so 10! cannot be expressed as 
a sum of three squares. 


SOLUTIONS TO ADDITIONAL 
EXERCISES 


Section 1 


1 (a) V14 = [3, (1,2, 1,6)] and has convergents 
3 4 11- I5 102-162-333 =449 
PI gana e ee 
The convergents, shown in bold-faced type, which come 
immediately before the end of the cycle give the required solutions. 


15? — 14 x 4? =1, sor=15, y=4 
and 
449? — 14 x 120? = 1, so x = 449, y = 120. 
(b) v18 = [4, (4,8)] and has convergents 
4 17 140 577 


ee Se RS ee 
The convergents, shown in bold-faced type, give the two required 
solutions. 


177-18x 47 =1, sox=17, y=4 
and 


5777 — 18 x 136? = 1, so z= 577, y = 136. 
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Each of the primes 3, 7, 11 and 19 
is of the form 4k + 3. 


V17 = [4, (8)] and has convergents 
4 33 268 2177 
La 60= Deke Se 
As the cycle has length 1 the convergents alternately give solutions of 
x? — 17y? = —1 and 2? — 17y? = 1. 
4*-17x 1? ==1, -337 -17 x 8* =1, 
268? — 17 x 65? = —1, 21777 — 17 x 528? = 1. 
So the solutions of z? — 17y? = —1 are x = 4, y= 1 and x = 268, 


y = 65, and the solutions of z? — 17y? = +1 are z = 33, y = 8 and 
big WA Wee etre 


z? — ny? = (227 + 1)? — n(2x1y1)? = 4a} + 4a? + 1 — Ana? y? 
= dei (xj — nyi) + 4ay +1 
= 4z?(—1) + 40? +1=1. 
The first solution of x? — 74y? = —1 will arise from the convergent 
[8, 1,1, 1,1]. Working systematically through the convergents: 
8 9 17 26 43 
= oe ee ELLE 
ss oie ee ee ERIE 
The solution of z? — 74y? = 1, given by the first part of the question is 
then 
x = 2 x 43? +1 = 3699, y= 2 x 43 x 5 = 430. 


In fact this is the smallest positive solution (given by the numerator 
and denominator of the convergent [8, 1,1,1,1, 16,1, 1,1, 1]). 


(a) A square is congruent modulo 4 to either 0 or 1. Hence, if 
n = 3 (mod 4), 
az” — ny? = {0 or 1} — {0 or 3} = {0,1 or 2} (mod 4). 


2 _ ny? = —1 is impossible. 


In this case x 
(b) Let p be an odd prime divisor of n. Then 


x? = m (mod p) 


and so m is a quadratic residue of p. 


For the equation x? — 34y? = —1, we note that 17 is the only odd 
prime divisor of 34 and —1 is a quadratic residue of 17 (by Euler’s 
Criterion). But the equation is known to have no solutions because 
v34 = [5, (1,4, 1, 10)] has a cycle of even length. 


A little algebra reveals the connection with Pell’s equation: 

x? + 2xy — 24? = 1 can be written as (£ + y)? — 3y? = 1. 
Substituting z = x + y we obtain 

z? —3y? =1 


whose solutions are found from the convergents of 3 = [1, (1,2)] which 
are 

7 19 26 

7 iBo: 

The three convergents shown in bold give the three required solutions: 


E S 
PRS 


A E A E e S E 
C= y— A fives F350) =A: 
2=26,-y=15 gives f= 11,9 =I15- 


Check: 43? — 74 x 5? = —1. 


Check: 3699? — 74 x 430? = 1. 
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Let the number be x. The properties it has to possess are 


z+1l=y" and ztlez. 


Eliminating x from this pair of equations produces y? — 22? = —1. As 
the continued fraction v2 = [1, (2)] has a cycle of length 1, every odd 
convergent yields a solution of this equation. The convergents begin 
as. -ITa 99 239 
1’ 2’ 5’ 12’ 29° 70° 169° 
giving the required solutions: 


y =1, z = 1 and hence x = 0 (which we discount as it is not positive); 


y=7, z = 5 and hence z = 48 (the one given); 
y = 41, z = 29 and hence x = 1680; 
y = 239, z = 169 and hence x = 57120. 


Section 2 


1 
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If (x,y,z) is a primitive Pythagorean triple with x as the even member 
then z = 2mn and y = m? — n? for relatively prime integers m and n of 
opposite parity. Then 

z+y=2mn+m? — n? = (m+n)? — 2n?. 
As m+n is odd (m+n)? = 1 (mod 8), and 2n? = 0 or 2 (mod 8) 
depending on whether n is even or odd respectively. 
Hence x +y = 1 or 7 (mod 8). 


Similarly, 


xz —y =2mn— m? +n? = —(m—n)? + 2n? = 1 or 7 (mod 8). 


For the triple (x,y,z) the area of the associated triangle is > and the 
perimeter is x + y+ z. So we have the two equations 

r? +y =z? and = =xz+y+z 
to solve simultaneously. Eliminating z 

z? +y? = (Z -2-y) = py — xy — ry” + 22y, 
which, after cancelling x? + y?, multiplying through by 4 and dividing 
throughout by xy, reduces to 

sy — 4% —4y+8=0 
and hence to 

(x — 4)(y—4) =8. 


For positive x and y, the product on the left must be 8 x 1 or 4 x 2, 
there are just two solutions (ignoring those with x and y interchanged), 
namely (12,5, 13) and (8,6, 10). 


If neither x nor y is divisible by 3 then x? + y? = 2 (mod 3) which 
cannot be a square. 


Theorem 2.1 showed that the even member in any primitive 
Pythagorean triple is divisible by 4; in any multiple of this triple it will 
still be a multiple of 4. 


If neither x nor y is divisible by 5 then 
x? +y? = {1 or 4} + {1 or 4} = {0,2 or 3} (mod 5). 


As 2 and 3 are not quadratic residues of 5 we must have 
x? +y? = 0 (mod 5) which means that z is divisible by 5. 


Hence at least one of x, y or z is divisible by 5. 


Section 3 


If m = V5n then 
5n — 2 - 
5n—2m _ 5-25 _ ig 
m — 2n V5 —-2 
Note that m — 2n = (v5 — 2)n and since 0 < v5 — 2 < 1 we have 
0 <m-— 2n <n. Hence the assumption that v5 can be expressed as 
the quotient of positive integers = leads to an expression for v5 as the 


quotient of positive integers with a smaller denominator. This gives us 
our descent step. As we cannot descend forever through positive 
integers this is a contradiction. Hence v35 is irrational. 


(a) As the right-hand side of the equation 
w +r? +y + 2? = 8kwryz 


is even, then either none, two or all four of the variables w, x, y and 
z must be odd. Recalling that the square of any odd integer is 
congruent modulo 8 to 1, whilst the square of an even integer is 
congruent modulo 8 to either 0 or 4, 


w? +a? +y? + z? =4 (mod 8), when all four are odd 
and 
w +a? +y? + 2? = {2 or 6} (mod 8), when two are odd. 


As the right-hand side of the equation is congruent modulo 8 to 0, 
the only possibility, therefore, is that all four of w, z, y and z are 
even. 


Now we can write w = 2w 1, © = 221, y = 2y; and z = 2z; and the 
equation becomes 


wi tatty? +2? = 82kwiziyiz = 8kiwiniyizi, 


for kı = 4k. This is another positive solution of the original 
equation (with the variables w, x,y,z and k) in which the four 
values w, x, y and z have each been halved. We cannot descend for 
ever through positive integer values in this way, and so the 
assumption of a solution is contradicted. 


(b) By very similar reasoning to that above, in any solution of 
w +r? +y? + 2? = Qwayz 


w, x, y and z must all be even. Putting w = 2w,, x = 221, y = 2y; 
and z = 2z, the equation becomes 


w? + r? + y? + z2 = 8w T1121, 


which we have seen has no solutions. 


The fifth variable, k, has increased 


in value, but this does not affect 
the descent of the other four 
positive integers. In fact we need 
only consider one variable, for 
example, z. 


If all four are odd the left-hand 


side is divisible by 4 but the right 
is not. If two are odd and two even 
the right-hand side is divisible by 4 


but the left is not. 
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First note that, as zt — y* has to be even, in any solution x and y have 
the same parity. As we are given that there is no solution with both x 
and y odd we are left with the task of showing there is no solution in 
which both are even. So, suppose to the contrary that there is a 
positive solution in which z and y are both even. Putting x = 27; and 
y = 2yı the equation becomes 16(2t — yf) = 227. As the left-hand side 
of this equation is divisible by 16 we must have that z is a multiple of 4, 
and putting z = 42; we get x} — yf = 2z?. This completes the descent 
step as 0 < 21 < z. The assumption that there exists a positive solution 
has lead to the existence of a smaller positive one; an impossible 
situation. Hence the equation has no positive solution. 


Suppose that x? + y? = x?y?. Then, rearranging the equation, 
(2? —1)(y? 1) =1. 


For the product of two integers to give 1, either each bracket on the left 
is equal to 1 or each is equal to —1. Hence z = y = V2 or x = y = 0, 
neither of which gives a positive integer solution. 


Section 4 


1 
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M6 =6x 7 =(F Sor aT Hie 
260 = 2? x 5 x 13 = 27(1? + 27)(2? + 3?) = 27(8? + 17) = 167 4+ 2? 
245 x 260 = (7? + 147)(16? + 2?) = 140? + 210? 


The result is true. If m and n are each sums of two squares then, for 
each prime p of the form 4k + 3 which divides n, the exponent of p in n 
is even and the exponent of p in m is even (possibly 0). Now the 


ey are 5 
exponent of p in — is the difference of these even numbers, and hence 
m 


even. The result follows from Theorem 4.3. 
If p = 4k + 1 then p can be expressed as a sum of two squares, say 
p =a? +b?. But then 
2p = (1? + 1°) (a? + b°) = (a +b)? + (a — b)? 
and so 2p can also be expressed as a sum of two squares. 


For uniqueness, suppose that 2p = x? + y?. Then x? +y? = 2 (mod 4) 
which means that each of x and y is odd. In that case 


21,2 zr+y 2 r-y 
2S ay Sa pares SE 7 


which gives the expression 


2 2 
ey it Banat yay 
Pale ole @, 
for p as a sum of two squares. But we know by Theorem 4.2 that this 


PY ae 
d 
and <5 


expression is unique. Therefore 


, and in consequence x 
and y, are determined uniquely. 


The smaller solution must still 
have x; and yı even as the 
alternative has been excluded. 


We seek the three smallest integers exceeding 1000 which can be written 
in the form 4"(8m + 7). Any integer which is congruent to 7 modulo 8 
is of this form and the only other candidates are multiples of 4. Starting 
at 1000, we examine numbers of either of these forms, as follows: 
1004 = 4 x 251 and 251=3 (mod 8); 
1007 = 7 (mod 8), so 1007 is not a sum of three squares; 
1008 = 4? x 63 and 63=7 (mod 8), so 1008 is not a sum of three squares; 
1012 = 4 x 253 and 253=5 (mod 8); 
1015 = 7 (mod 8), so 1015 is not a sum of three squares. 
1 
The triangular numbers are T, = eee 
n = T, +T; + Ti then 
AEEY . s(8+1). G 
RE 7 ua 5) $ a 


, for r > 1. So if 


So 
8n +3 = 4r(r +1) + 4s(s +1) +4t(t+1)+3 
= (2r +1)? + (28 + 1)? + (2t +1)? 
which confirms that 8n + 3 is a sum of three squares. 


The converse also holds. Starting with this expression for 8n + 3 as the 
sum of three squares (which must all be odd) we reverse the algebraic 
steps and recover the expression for n as a sum of three triangular 
numbers. 


As no integer of the form 8n + 3 is also of the form 4"(8s + 7), 
Theorem 4.4 has the corollary that every positive integer is a sum of 
three triangular numbers. 


47 


INDEX 


algebraic numbers 19 
cycle length 7 
descent step 20 
Euclid’s Elements 13 


Fermat’s Last Theorem 18 
fundamental solution 9 


hypoteneuse 12 


ideal prime divisors 19 
Important Identity for four squares 33 
Important Identity for two squares 25 


48 


Lagrange’s Four Square Theorem 33 
method of infinite descent 20 


parity 13 

Pell’s equation 5 

primitive Pythagorean triple 12 
Pythagorean equation 12 
Pythagorean triple 12 


regular primes 19 
ring 19 


sums of squares 24 
sums of three squares 32 


